( 1 ) 



SEQUENCE LISTING 
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(2) 


INFORMATION FOR SEQ ID NO:l: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1350 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 


(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..1350 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 


ATG GCT GAA ATT GTT GTC TCC AAA TTT GGC GGT ACC AGC GTA GCT GAT 4 8 

Met Ala Glu lie Val Val Ser Lys Phe Gly Gly Thr Ser Val Ala Asp 

1 5 10 15 

TTT GAC GCC ATG AAC CGC AGC GCT GAT ATT GTG CTT TCT GAT GCC AAC 96 

Phe Asp Ala Met Asn Arg Ser Ala Asp He Val Leu Ser Asp Ala Asn 

20 25 30 

GTG CGT TTA GTT GTC CTC TCG GCT TCT GCT GGT ATC ACT AAT CTG CTG 144 

Val Arg Leu Val Val Leu Ser Ala Ser Ala Gly lie Thr Asn Leu Leu 


35 40 

GTC GCT TTA GCT GAA GGA CTG GAA CCT 

Val Ala Leu Ala Glu Gly Leu Glu Pro 

50 55 

GAC GCT ATC CGC AAC ATC CAG TTT GCC 

Asp Ala lie Arg Asn lie Gin Phe Ala 

65 70 

CCG AAC GTT ATC CGT GAA GAG ATT GAA 

Pro Asn Val lie Arg Glu Glu lie Glu 

85 

GTT CTG GCA GAA GCG GCG GCG CTG GCA 

Val Leu Ala Glu Ala Ala Ala Leu Ala 

100 105 

GAG CTG GTC AGC CAC GGC GAG CTG ATG 

Glu Leu Val Ser His Gly Glu Leu Met 

115 120 

ATC CTG CGC GAA CGC GAT GTT CAG GCA 

lie Leu Arg Glu Arg Asp Val Gin Ala 

130 135 

GTG ATG CGT ACC AAC GAC CGA TTT GGT 

Val Met Arg Thr Asn Asp Arg Phe Gly 

145 150 


GGC GAG CGA TTC GAA AAA CTC 192 
Gly Glu Arg Phe Glu Lys Leu 
60 

ATT CTG GAA CGT CTG CGT TAC 240 
lie Leu Glu Arg Leu Arg Tyr 
75 80 

CGT CTG CTG GAG AAC ATT ACT 288 
Arg Leu Leu Glu Asn lie Thr 
90 95 

ACG TCT CCG GCG CTG ACA GAT 336 
Thr Ser Pro Ala Leu Thr Asp 
110 

TCG ACC CTG CTG TTT GTT GAG 384 

Ser Thr Leu Leu Phe Val Glu 
125 

CAG TGG TTT GAT GTA CGT AAA 432 
Gin Trp Phe Asp Val Arg Lys 
140 

CGT GCA GAG CCA GAT ATA GCC 480 
Arg Ala Glu Pro Asp lie Ala 
155 160 
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528 



GCG CTG GCG GAA CTG GCC GCG CTG 
Ala Leu Ala Glu Leu Ala Ala Leu 
165 

GGC TTA GTG ATC ACC CAG GGA TTT 
Gly Leu Val lie Thr Gin Gly Phe 
180 

ACA ACG ACG CTT GGC CGT GGA GGC 
Thr Thr Thr Leu Gly Arg Gly Gly 
195 200 

GCG GAG GCT TTA CAC GCA TCT CGT 
Ala Glu Ala Leu His Ala Ser Arg 
210 215 

GGC ATC TAC ACC ACC GAT CCA CGC 
Gly He Tyr Thr Thr Asp Pro Arg 
225 230 

GAT GAA ATC GCG TTT GCC GAA GCG 
Asp Glu lie Ala Phe Ala Glu Ala 
245 

AAA GTA CTG CAT CCG GCA ACG TTG 
Lys Val Leu His Pro Ala Thr Leu 
260 

CCG GTC TTT GTC GGC TCC AGC AAA 
Pro Val Phe Val Gly Ser Ser Lys 
275 280 

GTG TGC AAT AAA ACT GAA AAT CCG 
Val Cys Asn Lys Thr Glu Asn Pro 
290 295 

CGT CGC AAT CAG ACT CTG CTC ACT 
Arg Arg Asn Gin Thr Leu Leu Thr 
305 310 

TCT CGC GGT TTC CTC GCG GAA GTT 
Ser Arg Gly Phe Leu Ala Glu Val 
325 

ATT TCG GTA GAC TTA ATC ACC ACG 
lie Ser Val Asp Leu lie Thr Thr 
340 

CTT GAT ACC ACC GGT TCA ACC TCC 
Leu Asp Thr Thr Gly Ser Thr Ser 
355 360 

TCT CTG CTG ATG GAG CTT TCC GCA 
Ser Leu Leu Met Glu Leu Ser Ala 
370 375 



CAG CTG CTC CCA CGT CTC AAT GAA 
Gin Leu Leu Pro Arg Leu Asn Glu 
170 I 75 

ATC GGT AGC GAA AAT AAA GGT CGT 
lie Gly Ser Glu Asn Lys Gly Arg 
185 190 

AGC GAT TAT ACG GCA GCC TTG CTG 
Ser Asp Tyr Thr Ala Ala Leu Leu 
205 

GTT GAT ATC TGG ACC GAC GTC CCG 
Val Asp lie Trp Thr Asp Val Pro 
220 

GTA GTT TCC GCA GCA AAA CGC ATT 
Val Val Ser Ala Ala Lys Arg lie 
235 240 

GCA GAG ATG GCA ACT TTT GGT GCA 
Ala Glu Met Ala Thr Phe Gly Ala 
250 255 

CTA CCC GCA GTA CGC AGC GAT ATC 
Leu Pro Ala Val Arg Ser Asp lie 
265 270 

GAC CCA CGC GCA GGT GGT ACG CTG 
Asp Pro Arg Ala Gly Gly Thr Leu 
285 

CCG CTG TTC CGC GCT CTG GCG CTT 
Pro Leu Phe Arg Ala Leu Ala Leu 
300 

TTG CAC AGC CTG AAT ATG CTG CAT 
Leu His Ser Leu Asn Met Leu His 
315 320 

TTC GGC ATC CTC GCG CGG CAT AAT 
Phe Gly lie Leu Ala Arg His Asn 
330 335 

TCA GAA GTG AGC GTG GCA TTA ACC 
Ser Glu Val Ser Val Ala Leu Thr 
345 350 

ACT GGC GAT ACG TTG CTG ACG CAA 
Thr Gly Asp Thr Leu Leu Thr Gin 
365 

CTG TGT CGG GTG GAG GTG GAA GAA 
Leu Cys Arg Val Glu Val Glu Glu 
380 


576 


624 


672 


720 


768 


816 


864 


912 


960 


1008 


1056 


1104 


1152 
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GGT 

CTG 

GCG 

CTG 

GTC 

GCG 

TTG 

ATT 

Gly 

385 

Leu 

Ala 

Leu 

Val 

Ala 

390 

Leu 

lie 

GCC 

GTT 

GGC 

AAA 

GAG 

GTA 

TTC 

GGC 

Ala 

Val 

Gly 

Lys 

Glu 

405 

Val 

Phe 

Gly 

ATG 

ATT 

TGT 

TAT 

GGC 

GCA 

TCC 

AGC 

Met 

lie 

Cys 

Tyr 

420 

Gly 

Ala 

Ser 

Ser 

GGC 

GAA 

GAT 

GCC 

GAG 

CAG 

GTG 

GTG 

Gly 

Glu 

Asp 

435 

Ala 

Glu 

Gin 

Val 

Val 

440 



GGC 

AAT 

GAC 

CTG 

TCA 

AAA 

GCC 

TGC 

1200 

Gly 

Asn 

Asp 

Leu 

Ser 

Lys 

Ala 

Cys 



395 





400 


GTA 

CTG 

GAA 

CCG 

TTC 

AAC 

ATT 

CGC 

1248 

Val 

Leu 

Glu 

Pro 

Phe 

Asn 

lie 

Arg 



410 





415 



CAT 

AAC 

CTG 

TGC 

TTC 

CTG 

GTG 

CCC 

1296 

His 

Asn 

Leu 

Cys 

Phe 

Leu 

Val 

Pro 


425 





430 




CAA 

AAA 

CTG 

CAT 

AGT 

AAT 

TTG 

TTT 

1344 

Gin 

Lys 

Leu 

His 

Ser 

Asn 

Leu 

Phe 



445 


GAG TAA 
Glu * 
450 


1350 


(2) INFORMATION FOR SEQ ID NO:2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 
■(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 

GATCCATGGC TGAAATTGTT GTCTCCAAAT TTGGCG 


(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:3: 

GTACCGCCAA ATTTGGAGAC AACAATTTCA GCCATG 


(2) INFORMATION FOR SEQ ID NO:4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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48 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 

CCCGGGCCAT GGCTACAGGT TTAACAGCTA AGACCGGAGT AGAGCACT 
(2) INFORMATION FOR SEQ ID NO:5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:5: 

GATATCGAAT TCTCATTATA GAACTCCAGC TTTTTTC 


(2) INFORMATION FOR SEQ ID NO:6: 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 917 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 


(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 3..911 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO:6: 

CC ATG GCT ACA GGT TTA ACA GCT AAG ACC GGA GTA GAG CAC TTC GGC 

Met Ala Thr Gly Leu Thr Ala Lys Thr Gly Val Glu His Phe Gly 

1 5 10 15 


ACC 

GTT 

GGA 

GTA 

GCA 

ATG 

GTT 

ACT 

CCA 

Thr 

Val 

Gly 

Val 

Ala 

20 

Met 

Val 

Thr 

Pro 

GAT 

ATC 

GCT 

GCT 

GGC 

CGC 

GAA 

GTC 

GCG 

Asp 

He 

Ala 

Ala 

35 

Gly 

Arg 

Glu 

Val 

Ala 

40 

TTG 

GAT 

TCT 

TTG 

GTT 

CTC 

GCG 

GGC 

ACC 

Leu 

Asp 

Ser 

50 

Leu 

Val 

Leu 

Ala 

Gly 

55 

Thr 

ACC 

GCC 

GCT 

GAA 

AAA 

CTA 

GAA 

CTG 

CTC 

Thr 

Ala 

65 

Ala 

Glu 

Lys 

Leu 

Glu 

70 

Leu 

Leu 

GGG 

GAT 

CGG 

GCG 

AAG 

CTC 

ATC 

GCC 

GGT 

Gly 

80 

Asp 

Arg 

Ala 

Lys 

Leu 

85 

lie 

Ala 

Gly 


TTC 

ACG 

GAA 

TCC 

GGA 

GAC 

ATC 

95 

Phe 

25 

Thr 

Glu 

Ser 

Gly 

Asp 

30 

lie 


GCT 

TAT 

TTG 

GTT 

GAT 

AAG 

GGC 

143 

Ala 

Tyr 

Leu 

Val 

Asp 

45 

Lys 

Gly 


ACT 

GGT 

GAA 

TCC 

CCA 

ACG 

ACA 

191 

Thr 

Gly 

Glu 

Ser 

60 

Pro 

Thr 

Thr 


AAG 

GCC 

GTT 

CGT 

GAG 

GAA 

GTT 

239 

Lys 

Ala 

Val 

75 

Arg 

Glu 

Glu 

Val 


GTC 

GGA 

ACC 

AAC 

AAC 

ACG 

CGG 

287 

Val 

Gly 

90 

Thr 

Asn 

Asn 

Thr 

Arg 

95 



37 
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ACA 

TCT 

GTG 

GAA 

CTT 

GCG 

GAA 

GCT 

GCT 

GCT 

TCT 

GCT 

GGC 

GCA 

GAC 

GGC 

335 

Thr 

Ser 

Val 

Glu 

Leu 

Ala 

Glu 

Ala 

Ala 

Ala 

Ser 

Ala 

Gly 

Ala 

Asp 

Gly 






100 





105 





110 



CTT 

TTA 

GTT 

GTA 

ACT 

CCT 

TAT 

TAC 

TCC 

AAG 

CCG 

AGC 

CAA 

GAG 

GGA 

TTG 

383 

Leu 

Leu 

Val 

Val 

Thr 

Pro 

Tyr 

Tyr 

Ser 

Lys 

Pro 

Ser 

Gin 

Glu 

Gly 

Leu 





115 





120 





125 




CTG 

GCG 

CAC 

TTC 

GGT 

GCA 

ATT 

GCT 

GCA 

GCA 

ACA 

GAG 

GTT 

CCA 

ATT 

TGT 

431 

Leu 

Ala 

His 

Phe 

Gly 

Ala 

lie 

Ala 

Ala 

Ala 

Thr 

Glu 

Val 

Pro 

lie 

Cys 




130 





135 





140 





CTC 

TAT 

GAC 

ATT 

CCT 

GGT 

CGG 

TCA 

GGT 

ATT 

CCA 

ATT 

GAG 

TCT 

GAT 

ACC 

479 

Leu 

Tyr 

Asp 

lie 

Pro 

Gly 

Arg 

Ser 

Gly 

lie 

Pro 

lie 

Glu 

Ser 

Asp 

Thr 



145 





150 





155 






ATG 

AGA 

CGC 

CTG 

AGT 

GAA 

TTA 

CCT 

ACG 

ATT 

TTG 

GCG 

GTC 

AAG 

GAC 

GCC 

527 

Met 

Arg 

Arg 

Leu 

Ser 

Glu 

Leu 

Pro 

Thr 

lie 

Leu 

Ala 

Val 

Lys 

Asp 

Ala 


160 

* 




165 





170 





175 


AAG 

GGT 

GAC 

CTC 

GTT 

GCA 

GCC 

ACG 

TCA 

TTG 

ATC 

AAA 

GAA 

ACG 

GGA 

CTT 

575 

Lys 

Gly 

Asp 

Leu 

Val 

Ala 

Ala 

Thr 

Ser 

Leu 

lie 

Lys 

Glu 

Thr 

Gly 

Leu 






180 





185 





190 



GCC 

TGG 

TAT 

TCA 

GGC 

GAT 

GAC 

CCA 

CTA 

AAC 

CTT 

GTT 

TGG 

CTT 

GCT 

TTG 

623 

Ala 

Trp 

Tyr 

Ser 

Gly 

Asp 

Asp 

Pro 

Leu 

Asn 

Leu 

Val 

Trp 

Leu 

Ala 

Leu 





195 





200 





205 




GGC 

GGA 

TCA 

GGT 

TTC 

ATT 

TCC 

GTA 

ATT 

GGA 

CAT 

GCA 

GCC 

CCC 

ACA 

GCA 

671 

Gly 

Gly 

Ser 

Gly 

Phe 

lie 

Ser 

Val 

lie 

Gly 

His 

Ala 

Ala 

Pro 

Thr 

Ala 




210 





215 





220 





TTA 

CGT 

GAG 

TTG 

TAC 

ACA 

AGC 

TTC 

GAG 

GAA 

GGC 

GAC 

CTC 

GTC 

CGT 

GCG 

719 

Leu 

Arg 

Glu 

Leu 

Tyr 

Thr 

Ser 

Phe 

Glu 

Glu 

Gly 

Asp 

Leu 

Val 

Arg 

Ala 



225 





230 





235 






CGG 

GAA 

ATC 

AAC 

GCC 

AAA 

CTA 

TCA 

CCG 

CTG 

GTA 

GCT 

GCC 

CAA 

GGT 

CGC 

767 

Arg 

Glu 

He 

Asn 

Ala 

Lys 

Leu 

Ser 

Pro 

Leu 

Val 

Ala 

Ala 

Gin 

Gly 

Arg 


240 





245 





250 





255 


TTG 

GGT 

GGA 

GTC 

AGC 

TTG 

GCA 

AAA 

GCT 

GCT 

CTG 

CGT 

CTG 

CAG 

GGC 

ATC 

815 

Leu 

Gly 

Gly 

Val 

Ser 

Leu 

Ala 

Lys 

Ala 

Ala 

Leu 

Arg 

Leu 

Gin 

Gly 

lie 






260 





265 





270 



AAC 

GTA 

GGA 

GAT 

CCT 

CGA 

CTT 

CCA 

ATT 

ATG 

GCT 

CCA 

AAT 

GAG 

CAG 

GAA 

863 

Asn 

Val 

Gly 

Asp 

Pro 

Arg 

Leu 

Pro 

lie 

Met 

Ala 

Pro 

Asn 

Glu 

Gin 

Glu 





275 





280 





285 




CTT 

GAG 

GCT 

CTC 

CGA 

GAA 

GAC 

ATG 

AAA 

AAA 

GCT 

GGA 

GTT 

CTA 

TAA 

TGAGAATTC 

Leu 

Glu 

Ala 

Leu 

Arg 

Glu 

Asp 

Met 

Lys 

Lys 

Ala 

Gly 

Val 

Leu 

* 




290 295 300 


(2) INFORMATION FOR SEQ ID NO:7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 


918 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:7: 

CTTCCCGTGA CCATGGGCCA TC 

(2) INFORMATION FOR SEQ ID NO:8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 75 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:8: 

CATGGCTGGC TTCCCCACGA GGAAGACCAA CAATGACATT ACCTCCATTG CTAGCAACGG 
TGGAAGAGTA CAATG 

(2) INFORMATION FOR SEQ ID NO:9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 75 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:9: 

CATGCATTGT ACTCTTCCAC CGTTGCTAGC AATGGAGGTA ATGTCATTGT TGGTCTTCCT 
CGTGGGGAAG CCAGC 

(2) INFORMATION FOR SEQ ID NO:10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:10: 

CATGGCTTCC TCAATGATCT CCTCCCCAGC TGTTACCACC GTCAACCGTG CCGGTGCCGG 


CATGGTTGCT CCATTCACCG GCCTCAAAAG 



(2) INFORMATION FOR SEQ ID NO:11: 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:11: 

CATGCTTTTG AGGCCGGTGA ATGGAGCAAC CATGCCGGCA CCGGCACGGT TGACGGTGGT 
AACAGCTGGG GAGGAGATCA TTGAGGAAGC 

(2) INFORMATION FOR SEQ ID NO:12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:12: 

CCGGTTTGCT GTAATAGGTA CCA 

(2) INFORMATION FOR SEQ ID NO:13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:13: 

AGCTTGGTAC CTATTACAGC AAACCGGCAT G 
(2) INFORMATION FOR SEQ ID NO:14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:14: 


GCTTCCTCAA TGATCTCCTC CCCAGCT 



(2) INFORMATION FOR SEQ ID NO:15: 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:15: 
CATTGTACTC TTCCACCGTT GCTAGCAA 


(2) INFORMATION FOR SEQ ID NO:16: 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 


(ii) MOLECULE TYPE: DNA (genomic) 


(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 1..20 

(D) OTHER INFORMATION: /product= "synthetic 

oligonucleotide" 
/standard_name— "SM 
70" 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO:16: 


CTGACTCGCT GCGCTCGGTC 


(2) INFORMATION FOR SEQ ID NO:17: 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 


(ii) MOLECULE TYPE: DNA (genomic) 


(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 1..24 

(D) OTHER INFORMATION: /product= "synthetic 

oligonucleotide" 
/standard_name= "SM 
71" 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO:17: 


TATTTTCTCC TTACGCATCT GTGC 


28 


20 


24 
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(2) 


INFORMATION FOR SEQ ID NO:18: 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 


<ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 1..27 

(D) OTHER INFORMATION: /product= "synthetic 

oligonucleotide" 
/standard_name= "SM 
7 8" 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO:18: 
TTCATCGATA GGCGACCACA CCCGTCC 


(2) INFORMATION FOR SEQ ID NO:19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 


(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 1..27 

(D) OTHER INFORMATION: /product- "synthetic 

oligonucleotide" 
/standard_name= "SM 
7 9” 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO:19: 
AATATCGATG CCACGATGCG TCCGGCG 


(2) INFORMATION FOR SEQ ID NO:20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 55 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 1..55 


27 
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(D) OTHER INFORMATION: '/product- "synthetic 

oligonucleotide" 
/standard_name= "SM 
81" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:20: 

CATGGAGGAG AAGATGAAGG CGATGGAAGA GAAGATGAAG GCGTGATAGG TACCG 55 


(2) INFORMATION FOR SEQ ID NO:21: 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 55 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 


(ii) MOLECULE TYPE: DNA (genomic) 


(ix) FEATURE: 

(A) NAME/KEY: miscjeature 

(B) LOCATION: 1..55 

(D) OTHER INFORMATION: /product- "synthetic 

oligonucleotide" 
/standard_name= "SM 
80" 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO:21: 

AATTCGGTAC CTATCACGCC TTCATCTTCT CTTCCATCGC CTTCATCTTC TCCTC 


55 


(2) INFORMATION FOR SEQ ID NO:22: 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 


(ii) MOLECULE TYPE: protein 


(ix) FEATURE: 

(A) NAME/KEY: Protein 

(B) LOCATION: 1..14 

(D) OTHER INFORMATION: /label- name 

/note- "base gene 
[ (SSP5)2]" 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 


Met Glu Glu Lys Met Lys Ala Met Glu Glu Lys Met Lys Ala 
15 10 


(2) INFORMATION FOR SEQ ID NO:23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 


11 



(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 


(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 1..21 

(D) OTHER INFORMATION: /product- 

"synthetic 
oligonucleotide" 
/standard_name= "SM 
84" 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 
GATGGAGGAG AAGATGAAGG C 


(2) INFORMATION FOR SEQ ID NO:24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 


(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 1..21 

(D) OTHER INFORMATION: /product- "synthetic 

oligonucleotide" 
/standard_name= "SM 
85" 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO:24: 
ATCGCCTTCA TCTTCTCCTC C 


(2) INFORMATION FOR SEQ ID NO:25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 


(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 1..21 

(D) OTHER INFORMATION: /product- "synthetic 

oligonucleotide" 
/standard_name= "SM 
82" 


21 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:25: 

GATGGAGGAG AAGCTGAAGG C 21 

(2) INFORMATION FOR SEQ ID NO:26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 1..21 

(D) OTHER INFORMATION: /product- "synthetic 

oligonucleotide" 
/standard_name= "SM 
83” 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:26: 

ATCGCCTTCA GCTTCTCCTC C 21 

(2) INFORMATION FOR SEQ ID NO:27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:27: 

Met Glu Glu Lys Leu Lys Ala 
1 5 

(2) INFORMATION FOR SEQ ID NO:28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: > protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:28: 

Met Glu Glu Lys Met Lys Ala 
1 5 
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(2) INFORMATION FOR SEQ ID NO:29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 160 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: E. coli 

(G) CELL TYPE: DH5 alpha 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: C15 


(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 2..151 

(D) OTHER INFORMATION: /function= "synthetic 

storage protein" 
/product= "protein" 
/gene= "ssp" 

/standard_name= 

" 5 . 7 . 7 . 7 . 7 . 7 . 5 " 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:29: 

C ATG GAG GAG AAG ATG AAG GCG ATG GAG GAG AAG CTG AAG GCG ATG 
Met Glu Glu Lys Met Lys Ala Met Glu Glu Lys Leu Lys Ala Met 
15 

GAG GAG AAG CTG AAG GCG ATG GAG GAG 

Glu Glu Lys Leu Lys Ala Met Glu Glu 

20 

AAG CTG AAG GCG ATG GAG GAG AAG CTG 

Lys Leu Lys Ala Met Glu Glu Lys Leu 

35 40 

AAG GCG TGATAGGTAC CG 

Lys Ala 

50 

(2) INFORMATION FOR SEQ ID NO: 


10 15 

AAG CTG AAG GCG ATG GAG GAG 94 
Lys Leu Lys Ala Met Glu Glu 
25 30 

AAG GCG ATG GAA GAG AAG ATG 142 
Lys Ala Met Glu Glu Lys Met 
45 

160 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 49 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:30: 


Met Glu Glu Lys Met Lys Ala Met Glu Glu Lys Leu Lys Ala Met Glu 
1 5 10 15 
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Glu Lys Leu Lys Ala Met Glu Glu Lys Leu Lys Ala Met Glu Glu Lys 
20 25 30 

Leu Lys Ala Met Glu Glu Lys Leu Lys Ala Met Glu Glu Lys Met Lys 
35 40 45 


Ala 


(2) INFORMATION FOR SEQ ID NO:31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 160 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: E. coli 

(G) CELL TYPE: DH5 alpha 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: C20 


(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 2..151 

(D) OTHER INFORMATION: /function= "synthetic 

storage protein" 

/product= "protein" 

/gene= "ssp" 

/standard_name= 

"5.7.7.7.7.7.5" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:31: 

C ATG GAG GAG AAG ATG AAG GCG ATG GAG GAG AAG CTG AAG GCG ATG 4 6 

Met Glu Glu Lys Met Lys Ala Met Glu Glu Lys Leu Lys Ala Met 
1 5 10 15 

GAG GAG AAG CTG AAG GCG ATG GAG GAG AAG CTG AAG GCG ATG GAG GAG 94 

Glu Glu Lys Leu Lys Ala Met Glu Glu Lys Leu Lys Ala Met Glu Glu 

20 25 30 

AAG CTG AAG GCG ATG GAG GAG AAG CTG AAG GCG ATG GAA GAG AAG ATG 142 

Lys Leu Lys Ala Met Glu Glu Lys Leu Lys Ala Met Glu Glu Lys Met 

35 40 45 

AAG GCG TGATAGGTAC CG 160 

Lys Ala 

50 


15 . 



(2) INFORMATION FOR SEQ ID NO:32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 49 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:32: 

Met Glu Glu Lys Met Lys Ala Met Glu Glu Lys Leu Lys Ala Met Glu 

1 5 10 15 

Glu Lys Leu Lys Ala Met Glu Glu Lys Leu Lys Ala Met Glu Glu Lys 

20 25 30 

Leu Lys Ala Met Glu Glu Lys Leu Lys Ala Met Glu Glu Lys Met Lys 
35 40 45 


Ala 

(2) INFORMATION FOR SEQ ID NO:33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 139 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: E. coli 

(G) CELL TYPE: DH5 alpha 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: C30 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 2..130 

(D) OTHER INFORMATION: /function= "synthetic 

storage protein" 

/product= "protein" 

/gene= "ssp" 

• /standard_name= 

"5 . 7 . 7 . 7 . 7 • 5 " 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:33: 

C ATG GAG GAG AAG ATG AAG GCG ATG GAG GAG AAG CTG AAG GCG ATG 46 

Met Glu Glu Lys Met Lys Ala Met Glu Glu Lys Leu Lys Ala Met 
1 5 10 15 

GAG GAG AAG CTG AAG GCG ATG GAG GAG AAG CTG AAG GCG ATG GAG GAG 94 

Glu Glu Lys Leu Lys Ala Met Glu Glu Lys Leu Lys Ala Met Glu Glu 
20 25 30 


16 



139 


AAG CTG AAG GCG ATG GAA GAG AAG ATG AAG GCG TGATAGGTAC CG 
Lys Leu Lys Ala Met Glu Glu Lys Met Lys Ala 
35 40 


(2) INFORMATION FOR SEQ ID NO:34: 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:34: 

Met Glu Glu Lys Met Lys Ala Met Glu Glu Lys Leu Lys Ala Met Glu 

1 5 10 15 

Glu Lys Leu Lys Ala Met Glu Glu Lys Leu Lys Ala Met Glu Glu Lys 

20 25 30 


Leu Lys Ala 
35 


Met Glu Glu Lys Met Lys Ala 
40 


(2) INFORMATION FOR SEQ ID NO: 35:. 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 97 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: E. coli 

(G) CELL TYPE: DH5 alpha 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: D16 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 2..88 

(D) OTHER INFORMATION: /function= "synthetic 

storage protein" 

/product= "protein" 

/gene= "ssp" 

/standard_name= 

"5.5.5.5" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:35: 

C ATG GAG GAG AAG ATG AAG GCG ATG GAG GAG AAG ATG AAG GCG ATG 4 6 

Met Glu Glu Lys Met Lys Ala Met Glu Glu Lys Met Lys Ala Met 

i 5 10 15 
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GAG GAG AAG ATG AAG GCG ATG GAA GAG AAG ATG AAG GCG TGATAGGTAC 95 
Glu Glu Lys Met Lys Ala Met Glu Glu Lys Met Lys Ala 
20 25 


(2) INFORMATION FOR SEQ ID NO:36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:36: 

Met Glu Glu Lys Met Lys Ala Met Glu Glu Lys Met Lys Ala Met Glu 
15 10 15 

Glu Lys Met Lys Ala Met Glu Glu Lys Met Lys Ala 
20 25 

(2) INFORMATION FOR SEQ ID NO:37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 118 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: E. coli 

(G) CELL TYPE: DH5 alpha 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: D20 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 2..109 

(D) OTHER INFORMATION: /function= "synthetic 

storage protein" 

/product= "protein" 

/gene= "ssp" 
/standard_name= 

"5.5.5.5.5" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:37: 

C ATG GAG GAG AAG ATG AAG GCG ATG GAG GAG AAG ATG AAG GCG ATG 46 

Met Glu Glu Lys Met Lys Ala Met Glu Glu Lys Met Lys Ala Met 
15 10 15 
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GAG GAG AAG ATG AAG GCG ATG GAG GAG AAG ATG AAG GCG ATG GAA GAG 94 

Glu Glu Lys Met Lys Ala Met Glu Glu Lys Met Lys Ala Met Glu Glu 

20 25 30 

AAG ATG AAG GCG TGATAGGTAC CG 118 

Lys Met Lys Ala 
35 

(2) INFORMATION FOR SEQ ID NO:38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:38: 

Met Glu Glu Lys Met Lys Ala Met Glu Glu Lys Met Lys Ala Met Glu 
15 10 15 

Glu Lys Met Lys Ala Met Glu Glu Lys Met Lys Ala Met Glu Glu Lys 

20 25 30 

Met Lys Ala 
35 

(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 97 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: E. coli 

(G) CELL TYPE: DH5 alpha 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: D33 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 2..88 

(D) OTHER INFORMATION: /function= "synthetic 

storage protein" 

/product= "protein" 

/gene- "ssp" 
/standard_name= 

"5.5.5.5" 


19 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:39: 

C ATG GAG GAG AAG ATG AAG GCG ATG GAG GAG AAG ATG AAG GCG ATG 46 

Met Glu Glu Lys Met Lys Ala Met Glu Glu Lys Met Lys Ala Met 
15 10 15 

GAG GAG AAG ATG AAG GCG ATG GAA GAG AAG ATG AAG GCG TGATAGGTAC 95 
Glu Glu Lys Met Lys Ala Met Glu Glu Lys Met Lys Ala 
20 25 


CG 


97 


(2) INFORMATION FOR SEQ ID NO:40: 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28. amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:40: 


Met Glu Glu 
1 


Lys Met Lys Ala Met Glu Glu 
5 10 


Lys Met Lys Ala Met Glu 
15 


Glu Lys Met 


Lys Ala Met Glu Glu Lys Met Lys Ala 
20 25 


(2) INFORMATION FOR SEQ ID NO:41: 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 


(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 1..21 

(D) OTHER INFORMATION: /product= "synthetic 

oligonucleotide" 
/standard_name= "SM 
86 " 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO:41: 


GATGGAGGAG AAGCTGAAGA A 


21 


(2) INFORMATION FOR'SEQ ID NO:42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 


20 



(ii) MOLECULE TYPE: DNA (genomic) 


(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 1..21 

(D) OTHER INFORMATION: /product= "synthetic 

oligonucleotide" 
/standard_name= "SM 
87" 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO:42: 
ATCTTCTTCA GCTTCTCCTC C 
(2) INFORMATION FOR SEQ ID NO:43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 


(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 1..21 

(D) OTHER INFORMATION: /product= "synthetic 

oligonucleotide" 
/standard_name= "SM 
88 " 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO:43: 
GATGGAGGAG AAGCTGAAGT G 
(2) INFORMATION FOR SEQ ID NO:44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 


(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 1..21 

(D) OTHER INFORMATION: /product= "synthetic 

oligonucleotide" 
/standard_name= "SM 
89" 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO:44: 

ATCCACTTCA GCTTCTCCTC C 


21 


21 


21 



( 2 ) 


INFORMATION FOR SEQ ID NO:45: 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 


(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 1..21 

(D) OTHER INFORMATION: /product= "synthetic 

oligonucleotide" 
/standard_name= "SM 
90" 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO:45: 
GATGGAGGAG AAGATGAAGA A 
(2) INFORMATION FOR SEQ ID NO:46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 


(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 1..21 

(D) OTHER INFORMATION: /product^ "synthetic 

oligonucleotide" 
/standard_name= "SM 
91" 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO:46: 

ATCTTCTTCA TCTTCTCCTC C 
(2) INFORMATION FOR SEQ ID NO:47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 1..21 


21 


22 



(D) OTHER INFORMATION: 


/product= "synthetic 
oligonucleotide" 

/s t andard_name= "SM 
92" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:47: 
GATGGAGGAG AAGATGAAGT G 


(2) INFORMATION FOR SEQ ID NO:48: 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 


(ii) MOLECULE TYPE: DNA (genomic) 


(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 1..21 

(D) OTHER INFORMATION: /product- "synthetic 

oligonucleotide" 
/standard_name= "SM 
93" 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO:48: 


ATCCACTTCA TCTTCTCCTC C 


(2) INFORMATION FOR SEQ ID NO:49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:49: 


Met Glu Glu Lys Leu Lys Lys 
1 5 


(2) INFORMATION FOR SEQ ID NO:50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: 


protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:50: 


Met Glu Glu Lys Leu Lys Trp 
1 5 

(2) INFORMATION FOR SEQ ID NO:51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

i 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:51: 

Met Glu Glu Lys Met Lys Lys 
1 5 

(2) INFORMATION FOR SEQ ID NO:52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:52: 

Met Glu Glu Lys Met Lys Trp 
1 5 

(2) INFORMATION FOR SEQ ID NO:53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 160 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: E. coli 
(G) CELL TYPE: DH5 alpha 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: 82-4 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 2..151 

(D) OTHER INFORMATION: /function= "synthetic 

storage protein 
/product= "protein" 
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/gene= "ssp" 
/standard_name= 

"7.7.7.7 .7.7.5" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:53: 

C ATG GAG GAG AAG CTG AAG GCG ATG GAG GAG AAG CTG AAG GCG ATG 46 

Met Glu Glu Lys Leu Lys Ala Met Glu Glu Lys Leu Lys Ala Met 
15 10 15 


GAG 

GAG 

AAG 

CTG 

AAG 

GCG 

ATG 

GAG 

GAG 

AAG 

CTG 

AAG 

GCG 

ATG 

GAG 

GAG 

94 

Glu 

Glu 

Lys 

Leu 

Lys 

20 

Ala 

Met 

Glu 

Glu 

Lys 

25 

Leu 

Lys 

Ala 

Met 

Glu 

30 

Glu 


AAG 

CTG 

AAG 

GCG 

ATG 

GAG 

GAG 

AAG 

CTG 

AAG 

GCG 

ATG 

GAA 

GAG 

AAG 

ATG 

142 

Lys 

Leu 

Lys 

Ala 

35 

Met 

Glu 

Glu 

Lys 

Leu 

40 

Lys 

Ala 

Met 

Glu 

Glu 

45 

Lys 

Met 



AAG GCG TGATAGGTAC CG 160 

Lys Ala 

50 

(2) INFORMATION FOR SEQ ID NO:54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 49 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO:54: 


Met 

1 

Glu 

Glu 

Lys 

Leu 

5 

Lys 

Ala 

Met 

Glu 

Glu 

10 

Lys 

Leu 

Lys 

Ala 

Met 

15 

Glu 

Glu 

Lys 

Leu 

Lys 

20 

Ala 

Met 

Glu 

Glu 

Lys 

25 

Leu 

Lys 

Ala 

Met 

Glu 

30 

Glu 

Lys 

Leu 

Lys 

Ala 

35 

Met 

Glu 

Glu 

Lys 

Leu 

40 

Lys 

Ala 

Met 

Glu 

Glu 

45 

Lys 

Met 

Lys 


Ala 

(2) INFORMATION FOR SEQ ID NO:55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 97 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: E. coli 

(G) CELL TYPE: DH5 alpha 
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(vii) IMMEDIATE SOURCE: 

(B) CLONE: 84-H3 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 2 . . 88 

(D) OTHER INFORMATION: /function= "synthetic 

storage protein 
/product= "protein" 

/gene= "ssp" 

/standard_name= 

"5.5.5.5" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:55: 

C ATG GAG GAG AAG ATG AAG GCG ATG GAG GAG AAG ATG AAG GCG ATG 46 

Met Glu Glu Lys Met Lys Ala Met Glu Glu Lys Met Lys Ala Met 
15 10 15 

GAG GAG AAG ATG AAG GCG ATG GAA GAG AAG ATG AAG GCG TGATAGGTAC 95 

Glu Glu Lys Met Lys Ala Met Glu Glu Lys Met Lys Ala 
20 25 


CG 


97 


(2) INFORMATION FOR SEQ ID NO:56: 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:56: 


Met Glu Glu 
1 


Lys Met Lys Ala Met Glu Glu 
5 10 


Lys Met Lys Ala Met Glu 
15 


Glu Lys Met 


Lys Ala Met Glu Glu Lys Met 
20 25 


Lys Ala 


(2) INFORMATION FOR SEQ ID NO:57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 97 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: E. coli 

(G) CELL TYPE: DH5 alpha 


(vii) IMMEDIATE SOURCE: 

(B) CLONE: 86-H23 
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(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 2..88 

(D) OTHER INFORMATION: /function= "synthetic 

storage protein 
/product= "protein" 

/gene= "ssp" 

/standard_name= 

"5.8.8.5" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:57: 

C ATG GAG GAG AAG ATG AAG GCG ATG GAG GAG AAG CTG AAG AAG ATG 46 

Met Glu Glu Lys Met Lys Ala Met Glu Glu Lys Leu Lys Lys Met 
15 10 15 

GAG GAG AAG CTG AAG AAG ATG GAA GAG AAG ATG AAG GCG TGATAGGTAC 95 

Glu Glu Lys Leu Lys Lys Met Glu Glu Lys Met Lys Ala 
20 25 


CG 97 

(2) INFORMATION FOR SEQ ID NO:58: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:58: 

Met Glu Glu Lys Met Lys Ala Met Glu Glu Lys Leu Lys Lys Met Glu 
15 10 15 

Glu Lys Leu Lys Lys Met Glu Glu Lys Met Lys Ala 
20 25 

(2) INFORMATION FOR SEQ ID NO:59: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 112 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: E. coli 
(G) CELL TYPE: DH5 alpha 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: 88-2 
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(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 2..103 

(D) OTHER INFORMATION: /function= "synthetic 

storage protein 
/product= "protein" 

/gene= "ssp" 

/standard_name= 

"5 . 9 . 9 . 9 . 5 " 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:59: 

C ATG GAG GAG AAG ATG AAG GCG AAG AAG CTG AAG TGG ATG GAG GAG 4 6 

Met Glu Glu Lys Met Lys Ala Lys Lys Leu Lys Trp Met Glu Glu 

15 10 15 

AAG CTG AAG TGG ATG GAG GAG AAG CTG AAG TGG ATG GAA GAG AAG ATG 94 

Lys Leu Lys Trp Met Glu Glu Lys Leu Lys Trp Met Glu Glu Lys Met 

20 25 30 

AAG GCG TGATAGGTAC CG 112 

Lys Ala 

(2) INFORMATION FOR SEQ ID NO:60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:60: 

Met Glu Glu Lys Met Lys Ala Lys Lys Leu Lys Trp Met Glu Glu Lys 

1 5 10 15 

Leu Lys Trp Met Glu Glu Lys Leu Lys Trp Met Glu Glu Lys Met Lys 

20 25 30 


Ala 

(2) INFORMATION FOR SEQ ID NO:61: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 118 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: E. coli 

(G) CELL TYPE: DH5 alpha 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: 90-H8 
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<ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 2..109 

(D) OTHER INFORMATION: /function= "synthetic 

storage protein 
/product= "protein" 

/gene= "ssp" 

/standard_name= 

"5.10.10.10.5" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:61: 

C ATG GAG GAG AAG ATG AAG GCG ATG GAG GAG AAG ATG AAG AAG ATG ' 4 6 

Met Glu Glu Lys Met Lys Ala Met Glu Glu Lys Met Lys Lys Met 

15 10 15 

GAG GAG AAG ATG AAG AAG ATG GAG GAG AAG ATG AAG AAG ATG GAA GAG 94 

Glu Glu Lys Met Lys Lys Met Glu Glu Lys Met Lys Lys Met Glu Glu 

20 25 30 

AAG ATG AAG GCG TGATAGGTAC CG 118 

Lys Met Lys Ala 

35 

(2) INFORMATION FOR SEQ ID NO:62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:62: 


Met 

1 

Glu 

Glu 

Lys 

Met 

5 

Lys 

Ala 

Met 

Glu 

Glu 

10 

Lys 

Met 

Lys 

Lys 

Met 

15 

Glu 

Glu 

Lys 

Met 

Lys 

20 

Lys 

Met 

Glu 

Glu 

Lys 

25 

Met 

Lys 

Lys 

Met 

Glu 

30 

Glu 

Lys 


Met Lys Ala 
35 

(2) INFORMATION FOR SEQ ID NO:63: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 97 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: E. coli 
(G) CELL TYPE: DH5 alpha 
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(vii) IMMEDIATE SOURCE: 

(B) CLONE: 92-2 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 2.. 88 

(D) OTHER INFORMATION: /function= "synthetic 

storage protein 
/product^ "protein" 

/gene= "ssp" 

/standard_name= 

"5.11.11.5" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:63: 

C ATG GAG GAG AAG ATG AAG GCG ATG GAG GAG AAG ATG AAG TGG ATG 4 6 

Met Glu Glu Lys Met Lys Ala Met Glu Glu Lys Met Lys Trp Met 
15 10 15 

GAG GAG AAG ATG AAG TGG ATG GAA GAG AAG ATG AAG GCG TGATAGGTAC 95 

Glu Glu Lys Met Lys Trp Met Glu Glu Lys Met Lys Ala 

20 25 


CG 


97 


(2) INFORMATION FOR SEQ ID NO:64: 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 


(ii) MOLECULE TYPE: protein 



(xi) 

SEQUENCE DESCRIPTION: 

SEQ 

ID 

NO:64 : 


Met 

Glu Glu 

Lys Met Lys Ala Met Glu 

Glu 

Lys 

Met Lys 

Trp Met Glu 

1 


5 

10 



15 

Glu 

Lys Met 

Lys Trp Met Glu Glu Lys 

Met 

Lys 

Ala 



20 25 


(2) INFORMATION FOR SEQ ID NO:65: 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 


(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 1..84 

(D) OTHER INFORMATION: /product^ "synthetic 

oligonucleotide" 
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/standard_name= "SM 
96" 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO:65: 

GATGGAGGAA AAGATGAAGG CGATGGAGGA GAAAATGAAA GCTATGGAGG AAAAGATGAA 60 
AGCGATGGAG GAGAAAATGA AGGC 84 

(2) INFORMATION FOR SEQ ID NO:66: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(i±) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 1..84 

(D) OTHER INFORMATION: /product= "synthetic 

oligonucleotide" 
/standard_name= "SM 
97" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:66: 

ATCGCCTTCA TTTTCTCCTC CATCGCTTTC ATCTTTTCCT CCATAGCTTT CATTTTCTCC 60 
TCCATCGCCT TCATCTTTTC CTCC 84 

(2) INFORMATION FOR SEQ ID NO:67: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(ix) FEATURE: 

(A) NAME/KEY: Protein 

(B) LOCATION: 1..28 

(D) OTHER INFORMATION: /label= name 

/note= "(SSP 5)4" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:67: 

Met Glu Glu Lys Met Lys Ala Met Glu Glu Lys Met Lys Ala Met Glu 
15 10 15 

Glu Lys Met Lys Ala Met Glu Glu Lys Met Lys Ala 
20 25 
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( 2 ) 


INFORMATION FOR SEQ ID NO:68: 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 


(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 1..84 

(D) OTHER INFORMATION: /product= "synthetic 

oligonucleotide" 
/standard_name= "SM 
98" 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO:68: 


GATGGAGGAA AAGCTGAAAG CGATGGAGGA GAAACTCAAG GCTATGGAAG AAAAGCTTAA 60 


AGCGATGGAG GAGAAACTGA AGGC 


84 


(2) INFORMATION FOR SEQ ID NO:69: 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 


(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 1..84 

(D) OTHER INFORMATION: /product= "synthetic 

oligonucleotide" 
/standard_name= "SM 
99" 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO:69: 

ATCGCCTTCA GTTTCTCCTC CTACGCTTTA AGCTTTTCTT CCATAGCCTT GAGTTTCTCC 60 


TCCATCGCTT TCAGCTTTTC CTCC 


84 


(2) INFORMATION FOR SEQ ID NO:70: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 
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(ix) FEATURE: 

(A) NAME/KEY: Protein 

(B) LOCATION: 1..28 

(D) OTHER INFORMATION: /label= name 

/note= "(SSP 7)4" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:70: 

Met Glu Glu Lys Leu Lys Ala Met Glu Glu Lys Leu Lys Ala Met Glu 
15 10 15 


Glu Lys Leu 


Lys Ala Met Glu Glu Lys 
20 25 


Leu Lys Ala 


(2) INFORMATION FOR SEQ ID NO:71: 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 


(ii) MOLECULE TYPE: DNA (genomic) 


(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 1..84 

(D) OTHER INFORMATION: /product= "synthetic 

oligonucleotide" 
/standard_name= "SM 
100 " 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO:71: 


GATGGAGGAA AAGCTTAAGA AGATGGAAGA AAAGCTGAAA TGGATGGAGG AGAAACTCAA 60 


AAAGATGGAG GAAAAGCTTA AATG 


(2) INFORMATION FOR SEQ ID NO:72: 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 


(ii) MOLECULE TYPE: DNA (genomic) 


(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 1..84 

(D) OTHER INFORMATION: /product= "synthetic 

oligonucleotide" 
/standard_name= "SM 
101 " 


84 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:72: 

ATCCATTTAA GCTTTTCCTC CTACTTTTTG AGTTTCTCCT CCATCCATTT CAGCTTTTCT 60 
TCCATCTTCT TAAGCTTTTC CTCC 84 

(2) INFORMATION FOR SEQ ID NO:73: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

<ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:73: 

Met Glu Glu Lys Leu Lys Lys Met Glu Glu Lys Leu Lys Trp Met Glu 
15 10 15 

Glu Lys Leu Lys Lys Met Glu Glu Lys Leu Lys Trp 
20 25 

(2) INFORMATION FOR SEQ ID NO:74: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 243 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: E. coli 

(G) CELL TYPE: DH5 alpha 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: 2-9 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 2..235 

(D) OTHER INFORMATION: /function= "synthetic 

storage protein 
/product= "protein" 

/gene= "ssp" 

/standard_name- 

"7.7.7.7.7.7.8.9.8.9.5" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:74: 

C ATG GAG GAG AAG CTG AAG GCG ATG GAG GAG AAG CTG AAG GCG ATG 4 6 

Met Glu Glu Lys Leu Lys Ala Met Glu Glu Lys Leu Lys Ala Met 
15 10 15 
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94 


GAG GAG AAG CTG AAG GCG ATG GAG GAG AAG CTG AAG GCG ATG GAG GAG 

Glu Glu Lys Leu Lys Ala Met Glu Glu Lys Leu Lys Ala Met Glu Glu 

20 25 30 

AAG CTG AAG GCG ATG GAG GAG AAG CTG AAG GCG ATG GAG GAA AAG CTT 142 

Lys Leu Lys Ala Met Glu Glu Lys Leu Lys Ala Met Glu Glu Lys Leu 

35 40 45 

AAG AAG ATG GAA GAA AAG CTG AAA TGG ATG GAG GAG AAA CTC AAA AAG 190 

Lys Lys Met Glu Glu Lys Leu Lys Trp Met Glu Glu Lys Leu Lys Lys 

50 55 60 

ATG GAG GAA AAG CTT AAA TGG ATG GAA GAG AAG ATG AAG GCG TGATAGGTAC 242 
Met Glu Glu Lys Leu Lys Trp Met Glu Glu Lys Met Lys Ala 
65 70 75 

C 243 

(2) INFORMATION FOR SEQ ID NO:75: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 77 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 


(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:75: 


Met 

1 

Glu 

Glu 

Lys 

Leu 

5 

Lys 

Ala 

Met 

Glu 

Glu 

10 

Lys 

Leu 

Lys 

Ala 

Met 

15 

Glu 

Glu 

Lys 

Leu 

Lys 

20 

Ala 

Met 

Glu 

Glu 

Lys 

25 

Leu 

Lys 

Ala 

Met 

Glu 

30 

Glu 

Lys 

Leu 

Lys 

Ala 

35 

Met 

Glu 

Glu 

Lys 

Leu 

40 

Lys 

Ala 

Met 

Glu 

Glu 

45 

Lys 

Leu 

Lys 

Lys 

Met 

50 

Glu 

Glu 

Lys 

Leu 

Lys 

55 

Trp 

Met 

Glu 

Glu 

Lys 

60 

Leu 

Lys 

Lys 

Met 

Glu 

Glu 

Lys 

Leu 

Lys 

Trp 

Met 

Glu 

Glu 

Lys 

Met 

Lys 

Ala 





65 70 75 

(2) INFORMATION FOR SEQ ID NO:76: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 175 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: E. coli 
(G) CELL TYPE: DH5 alpha 
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(vii) IMMEDIATE SOURCE: 

(B) CLONE: 5-1 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 2..172 

(D) OTHER INFORMATION: /function= "synthetic 

storage protein 
/product= "protein" 

/gene= "ssp" 
/standard_name= 

"5.5.5.7.7.7.7.5" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:76: 

C ATG GAG GAG AAG ATG AAG GCG ATG GAG GAG AAG ATG AAG GCG ATG 46 

Met Glu Glu Lys Met Lys Ala Met Glu Glu Lys Met Lys Ala Met 
15 10 15 

GAG GAG AAG ATG AAG GCG ATG GAG GAA AAG CTG AAA GCG ATG GAG GAG 94 

Glu Glu Lys Met Lys Ala Met Glu Glu Lys Leu Lys Ala Met Glu Glu 

20 25 30 

AAA CTC AAG GCT ATG GAA GAA AAG CTT AAA GCG ATG GAG GAG AAA CTG 142 

Lys Leu Lys Ala Met Glu Glu Lys Leu Lys Ala Met Glu Glu Lys Leu 

35 40 45 

AAG GCC ATG GAA GAG AAG ATG AAG GCG TGATAG 179 

Lys Ala Met Glu Glu Lys Met Lys Ala 
50 55 

(2) INFORMATION FOR SEQ ID NO:77: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 56 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:77: 

Met Glu Glu Lys Met Lys Ala Met Glu Glu Lys Met Lys Ala Met Glu 
15 10 15 

Glu Lys Met Lys Ala Met Glu Glu Lys Leu Lys Ala Met Glu Glu Lys 
20 25 30 

Leu Lys Ala Met Glu Glu Lys Leu Lys Ala Met Glu Glu Lys Leu Lys 
35 40 45 

Ala Met Glu Glu Lys Met Lys Ala 
50 55 
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(2) INFORMATION FOR SEQ ID NO:78: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 187 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: E. coli 
(G) CELL TYPE: DH5 alpha 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 3..173 

(D) OTHER INFORMATION: /function*® "synthetic 

storage protein 
/product®* "protein" 

/gene= "ssp" 

/standard_name= 

"SSP-3-5" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:78: 

CC ATG GAG GAG AAG CTG AAG GCG ATG GAG GAG AAG CTG AAG GCG ATG 47 

Met Glu Glu Lys Leu Lys Ala Met Glu Glu Lys Leu Lys Ala Met 

1 5 10 15 

GAG GAG AAG CTG AAG GCG ATG GAG GAG AAG CTG AAG GCG ATG GAG GAG 95 

Glu Glu Lys Leu Lys Ala Met Glu Glu Lys Leu Lys Ala Met Glu Glu 

20 25 30 

AAG CTG AAG GCG ATG GAG GAG AAG CTG AAG GCG ATG GAG GAA AAG ATG 143 

Lys Leu Lys Ala Met Glu Glu Lys Leu Lys Ala Met Glu Glu Lys Met 

35 40 45 

AAG GCG ATG GAA GAG AAG ATG AAG GCG TGATAGGTAC CGAATTC 187 

Lys Ala Met Glu Glu Lys Met Lys Ala 
50 55 

(2) INFORMATION FOR SEQ ID NO:79: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 56 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:79: 

Met Glu Glu Lys Leu Lys Ala Met Glu Glu Lys Leu Lys Ala Met Glu 

1 5 10 15 

Glu Lys Leu Lys Ala Met Glu Glu Lys Leu Lys Ala Met Glu Glu Lys 

20 25 30 
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Leu Lys Ala Met Glu Glu Lys Leu Lys Ala Met Glu Glu Lys Met Lys 
35 40 45 

Ala Met Glu Glu Lys Met Lys Ala 
50 55 


(2) INFORMATION FOR SEQ ID NO:80: 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 61 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 


(ii) MOLECULE TYPE: DNA (genomic) 


(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 1..61 

(D) OTHER INFORMATION: /product^ "synthetic 

oligonucleotide" 
/standard_name= "SM 
107" 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO:80: 


CATGGAGGAG AAGATGAAAA AGCTCGAAGA GAAGATGAAG GTCATGAAGT GATAGGTACC 60 


G 


61 


(2) INFORMATION FOR SEQ ID NO:81: 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 61 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 


(ii) MOLECULE TYPE: DNA (genomic) 


(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: I..61 

(D) OTHER INFORMATION: /product= "synthetic 

ligonucleotide" 

/standard_name= "SM 
106" 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO:81: 


AATTCGGTAC CTATCACTTC ATGACCTTCA TCTTCTCTTC GAGCTTTTTC ATCTTCTCCT 60 


C 


61 
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(2) 


INFORMATION FOR SEQ ID NO:82: 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 


(ii) MOLECULE TYPE: protein 


(ix) FEATURE: 

(A) NAME/KEY: Protein 

(B) LOCATION: 1..16 

(D) OTHER INFORMATION: /label= name 

/note= "pSK34 base 
gene" 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO:82: 


Met GIu Glu Lys Met Lys Lys Leu Glu Glu Lys Met Lys Val Met Lys 
15 10 15 


(2) INFORMATION FOR SEQ ID NO:83: 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 63 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 


(ii) MOLECULE TYPE: DNA (genomic) 


(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 1..63 

(D) OTHER INFORMATION: /product= "synthetic 

oligonucleotide" 
/standard_name= "SM 
110 " 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO:83: 


GCTGGAAGAA AAGATGAAGG CTATGGAGGA CAAGATGAAA TGGCTTGAGG AAAAGATGAA 60 


GAA 


63 


(2) INFORMATION FOR SEQ ID NO:84: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 63 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 1..63 

(D) OTHER INFORMATION: /product= "synthetic 

oligonucleotide” 
/standard_name= "SM 
111 " 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:84: 

AGCTTCTTCA TCTTTTCCTC AAGCCATTTC ATCTTGTCCT CCATAGCCTT CATCTTTTCT 60 


TCC 


63 


(2) INFORMATION FOR SEQ ID NO:85: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:85: 


Met 

Glu 

Glu 

Lys 

Met 

1 




5 

Asp 

Lys 

Met 

Lys 

Trp 




20 


Met 

Lys 

Val 

Met 

Lys 



35 




10 


15 


25 


30 


(2) INFORMATION FOR SEQ ID NO:86: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:86: 

Met Glu Glu Lys Met Lys Lys Leu Glu Glu Lys Met Lys Ala Met Glu 
15 10 15 

Asp Lys Met Lys Trp Leu Glu Glu Lys Met Lys Lys Leu Glu Glu Lys 
20 25 30 

Met Lys Val Met Lys 
35 
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(2) INFORMATION FOR SEQ ID NO:87: 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 62 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 


(ii) MOLECULE TYPE: DNA (genomic) 


(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 1..62 

(D) OTHER INFORMATION: /product= "synthetic 

oligonucletide" 
/standard_name= "SM 
112 " 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO:87: 


GCTCGAAGAA AGATGAAGGC AATGGAAGAC AAAATGAAGT GGCTTGAGGA GAAAATGAAG 60 


AA 


62 


(2) INFORMATION FOR SEQ ID NO:88: 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 62 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 


(ii) MOLECULE TYPE: DNA (genomic) 


(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 1..62 

(D) OTHER INFORMATION: /product= "synthetic 

oligonucleotide" 
/standard_name= "SM 
113" 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO:88: 


AGCTTCTTCA TTTTCTCCTC AAGCCACTTC ATTTTGTCTT CCATTGCCTT CATCTTTCTT 60 


CG 


62 


(2) INFORMATION FOR SEQ ID NO:89: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:89: 


Met 

1 

Glu 

Glu 

Lys 

Met 

5 

Lys 

Lys 

Leu 

Lys 

Glu 

10 

Glu 

Met 

Ala 

Lys 

Met 

15 

Lys 

Asp 

Glu 

Met 

Trp 

20 

Lys 

Leu 

Lys 

Glu 

Glu 

25 

Met 

Lys 

Lys 

Leu 

Glu 

30 

Glu 

Lys 

Met 

Lys 

Val 

Met 

Lys 
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(2) INFORMATION FOR SEQ ID NO:90: 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 63 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 


(ii) MOLECULE TYPE: DNA (genomic) 


(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 1..63 

(D) OTHER INFORMATION: /product= "synthetic 

oligonucleotide" 

/standard_name= "SM 
114" 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO:90: 


GCTCAAGGAG GAAATGGCTA AGATGAAAGA CGAAATCTGG AAACTGAAAG AGGAAATGAA 60 


GAA 


63 


(2) INFORMATION FOR SEQ ID NO:91: 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 63 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 


(ii) MOLECULE TYPE: DNA (genomic) 


(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 1..63 

(D) OTHER INFORMATION: /product= "synthetic 

oligonucleotide" 
/standard_name= "SM 
115" 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO:91: 


AGCTTCTTCA TTTCCTCTTT CAGTTTCCAC ATTTCGTCTT TCATCTTAGC CATTTCCTCC 60 


TTG 


63 
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(2) INFORMATION FOR SEQ ID NO:92: 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 107 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:92: 

Met Glu Glu Lys Met Lys Lys Leu Lys Glu Glu Met Ala Lys 
15 10 

Asp Glu Met Trp Lys Leu Lys Glu Glu Met Lys Lys Leu Glu 
20 25 30 

Met Lys Val Met Glu Glu Lys Met Lys Lys Leu Glu Glu Lys 
35 40 45 

Ala Met Glu Asp Lys Met Lys Trp Leu Glu Glu Lys Met Lys 
50 55 60 

Glu Glu Lys Met Lys Val Met Glu Glu Lys Met Lys Lys Leu 
65 70 75 

Lys Met Lys Ala Met Glu Asp Lys Met Lys Trp Leu Glu Glu 
85 90 

Lys Lys Leu Glu Glu Lys Met Lys Val Met Lys 
100 105 

(2) INFORMATION FOR SEQ ID -NO: 93: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 839 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
{D) TOPOLOGY: linear 

(ii) MOLECULE TYPE; DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:93: 

GGATCCCCCG GGCTGCAGGA ATTCTACGTA CCATATAGTA AGACTTTGTA TATAAGACGT 60 

CACCTCTTAC GTGCATGGTT ATATGTGACA TGTGCAGTGA CGTTGTACCA TATAGTAAGA 120 

CTTTGTATAT AAGACGTCAC CTCTTACGTG CATGGTTATA TGTGACATGT GCAGTGACGT 180 

TAACCGCACC CTCCTTCCCG TCGTTTCCCA TCTCTTCCTC CTTTAGAGCT ACCACTATAT 240 

AAATCAGGGC TCATTTTCTC GCTCCTCACA GGCTCATCAG CACCCCGGCA GTGCCACCCC 300 

GACTCCCTGC ACCTGCCATG GGTACGCTAG CCCGGGAGAT CTGACAAAGC AGCATTAGTC 360 

CGTTGATCGG TGGAAGACCA CTCGTCAGTG TTGAGTTGAA TGTTTGATCA ATAAAATACG 420 


Met Lys 
15 

Glu Lys 

Met Lys 

Lys Leu 

Glu Glu 
80 

Lys Met 
95 
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GCAATGCTGT AAGGGTTGTT TTTTATGCCA TTGATAATAC ACTGTACTGT TCAGTTGTTG 480 
AACTCTATTT CTTAGCCATG CCAGTGCTTT TCTTATTTTG AATAACATTA CAGCAAAAAG 540 
TTGAAAGACA AAAAAANNNN NCCCCGAACA GAGTGCTTTG GGTCCCAAGC TTCTTTAGAC 600 
TGTGTTCGGC GTTCCCCCTA AATTTCTCCC CTATATCTCA CTCACTTGTC ACATCAGCGT 660 
TCTCTTTCCC CTATATCTCC ACGCTCTACA GCAGTTCCAC CTATATCAAA CCTCTATACC 720 
CCACCACAAC AATATTATAT ACTTTCATCT TCACCTAACT CATGTACCTT CCAATTTTTT 780 
TCTACTAATA ATTATTTACG TGCACAGAAA CTTAGGCAAG GGAGAGAGAG AGCGGTACC 839 
(2) INFORMATION FOR SEQ ID NO:94: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:94: 

CTAGAAGCCT CGGCAACGTC AGCAACGGCG GAAGAATCCG GTG 43 

(2) INFORMATION FOR SEQ ID NO:95: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:95: 

CATGCACCGG ATTCTTCCGC CGTTGCTGAC GTTGCCGAGG CTT 43 

(2) INFORMATION FOR SEQ ID NO:96: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 55 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:96: 

GATCCCATGG CGCCCCTTAA GTCCACCGCC AGCCTCCCCG TCGCCCGCCG CTCCT 55 
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(2) INFORMATION FOR SEQ ID NO:97: 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 55 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:97: 

CTAGAGGAGC GGCGGGCGAC GGGGAGGCTG GCGGTGGACT TAAGGGGCGC CATGG ! 
(2) INFORMATION FOR SEQ ID NO:98: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 59 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:98: 

CATGGCGCCC ACCGTGATGA TGGCCTCGTC GGCCACCGCC GTCGCTCCGT TCCAGGGGC 59 
(2) INFORMATION FOR SEQ ID NO:99: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 59 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:99: 

TTAAGCCCCT GGAACGGAGC GACGGCGGTG GCCGACGAGG CCATCATCAC GGTGGGCGC 59 
(2) INFORMATION FOR SEQ ID NO:100: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:100: 

GCGCCCACCG TGATGA n , 
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(2) INFORMATION FOR SEQ ID NO:101: 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:101: 

CACCGGATTC TTCCGC 16 

(2) INFORMATION FOR SEQ ID NO:102: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 372 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:102: 

GTAAGATTGG TAAAGTCCAG CAAGAAAATG AGATAAAAGA GAAGCCTGAA ATGACGAAAA 60 

AATCAGGTGT TTTGATTCTT GGTGCTGGAC GTGTGTNTCG CCCAGCTGCT GATTTCCTAG 120 

CTTCAGTTAG AACCATTTCG TCACAGCAAT GGTACAAAAC ATATTTCGGA GCAGACTCTG 180 

AAGAGAAAAC AGATGTTCAT GTGATTGTCG CGTCTCTGTA TCTTAAGGAT GCCAAAGAGA 240 

CGGTTGAAGG TATTTCAGAT GTAGAAGCAG TTCGGCTAGA TGTATCTGAT AGTGAAAGTC 300 

TCCTTAAGTA TGTTTCTCAG GTTGATGTTG TCCTAAGTTT ATTACCTGCA AGTTGTCATG 360 

CTTGTTGTAG CA 372 

(2) INFORMATION FOR SEQ ID NO:103: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 323 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 


(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:103: 


GGAAGCACAC 

TGCGACTCTT 

TTGGAATTCG 

GGGACATCAA 

GAATGGACAA 

ACAACAACCG 

60 

CTATGGCCAA 

GACTGTTGGG 

ATCCCTGCAG 

CCATTGGAGC 

TCTGCTGTTA 

ATTGAAGACA 

120 

AG AT C AAGAC 

AAGAGGAGTC 

TTAAGGCCTC 

TCGAAGCAGA 

GGTGTATTTG 

CCAGCTTTGG 

180 
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ATATATTGCA 

AGCATATGGT 

ATAAAGCTGA 

TGGAGAAGGC 

AGAATGATCA 

AAGAACTCTG 

240 

TATATTGTTT 

CTNTCTATAA 

CTTGGAGTTG 

GAGACAAAGC 

TGAAGGAGNC 

AGNGCCATTA 

300 

GACCAGCAAA 

AAAAGGAGGA 

GGA 




323 


(2) INFORMATION FOR SEQ ID NO:104: 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 123 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:104: 

Lys lie Gly Lys Val Gin Gin Glu Asn Glu lie Lys Glu Lys Pro Glu 

1 5 10 15 

Met Thr Lys Lys Ser Gly Val Leu lie Leu Gly Ala Gly Arg Val Xaa 

20 25 30 

Arg Pro Ala Ala Asp Phe Leu Ala Ser Val Arg Thr lie Ser Ser Gin 
35 40 45 

Gin Trp Tyr Lys Thr Tyr Phe Gly Ala Asp Ser Glu Glu Lys Thr Asp 
50 55 60 

Val His Val lie Val Ala Ser Leu Tyr Leu Lys Asp Ala Lys Glu Thr 
65 70 75 80 

Val Glu Gly lie Ser Asp Val Glu Ala Val Arg Leu Asp Val Ser Asp 
85 90 95 

Ser Glu Ser Leu Leu Lys Tyr Val Ser Gin Val Asp Val Val Leu. Ser 
100 105 110 

Leu Leu Pro Ala Ser Cys His Ala Cys Cys Ser 
115 120 

(2) INFORMATION FOR SEQ ID NO:105: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 74 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear ' 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:105: 

Lys His Thr Ala Thr Leu Leu Glu Phe Gly Asp lie Lys Asn Gly Gin 
1 5 10 15 
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Thr 

Thr 

Thr 

Ala 

20 

Met 

Ala 

Lys 

Thr 

Val 

25 

Gly 

lie 

Pro 

Ala 

Ala 

30 

lie 

Gly 

Ala 

Leu 

Leu 

35 

Leu 

He 

Glu 

Asp 

Lys 

40 

lie 

Lys 

Thr 

Arg 

Gly 

45 

Val 

Leu 

Arg 

Pro 

Leu 

50 

Glu 

Ala 

Glu 

Val 

Tyr 

55 

Leu 

Pro 

Ala 

Leu 

Asp 

60 

lie 

Leu 

Gin 

Ala 

Tyr 

65 

Gly 

lie 

Lys 

Leu 

Met 

70 

Glu 

Lys 

Ala 

Glu 








(2) INFORMATION FOR SEQ ID NO:106: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:106: 
ATTCCCCATG GTTTCGCCGA CGAAT 
(2) INFORMATION FOR SEQ ID NO:107: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:107: 
CTCTCGGTAC CTAGTACCTA CTGATCAAC 
(2) INFORMATION FOR SEQ ID NO:108: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:108: 
AGAGAAGCCT GAAATGACGA AAAA 
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(2) 


INFORMATION FOR SEQ ID NO:109: 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:109: 

GTCTTGGCCA TAGCGGTTGT TGTT 24 

(2) INFORMATION FOR SEQ ID NO:110: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8160 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:110: 


TCTAGATGCA 

CATTCAACTC 

GAGGTTGTTG 

CATGATGTTT 

CATTTACCAA 

AAAAAT CATA 

60 

GTCAAATTAT 

GTAAGCAAAT 

GATAT TACAG 

AAAAGTTTTA 

CTAGAGAGTT 

TCAGATTTAC 

120 

ACATGCACAA 

CGTTAAAAAA 

AATAGCAGAA 

AAAAGAAAGA 

AGAAAAGTTC 

TTTATTTGTG 

180 

AGAAAAATGT 

ATGAAAAAAA 

AAGAGATGGG 

TGTAAAAAGC 

AAAAGGATAG 

GACCACTGTT 

240 

ACTTTGTAGC 

CTCGTTGAGG 

AATCTCTTCT 

CGCATCTCGA 

CTTTTGTGCC 

ATTGCAAAGT 

300 

CAATGCCCAG 

AACTTGTTCC 

CAGGCCATCT 

CCAATTAACT 

ACGTCTATTT 

AATTAAACTT 

360 

TTAAAAGAAA 

ACCTAATAAA 

TTAAACAAAA 

GAAAAGCCGT 

CAACGAAATC 

TAAGCTTGCA 

420 

GCGATATCGA 

TGAACTGATA 

CCAAAACAAT 

GTTCAAGTTT 

CACTTTCAAA 

TTGTTTTTTC 

480 

TTGAAATAGT 

TTATTGGGTA 

AGGCCCATAG 

ATATTTCATA 

AGAAGAACAC 

TTGTCGAGGT 

540 

TGAATCGTAT 

GTCTGCCCAC 

CGCGGCCCAT 

GCATCCTCTG 

TTGGTAGCAT 

AATCGTTTTA 

600 

GGCCATACTA 

TTGTTCGTAC 

ACAC T GAT T T 

TGAAGTCACC 

TTTGTGCACT 

CCTTAATTCC 

660 

TAAATTGAAG 

AAGCTTGTTC 

TCATTCTTCT 

TTGGGTTACA 

AATGCCAAGG 

CAAAAGGAAC 

720 

TTGGGCCAAA 

TTAAGACAAC 

AACTCAAGCC 

CACTCTCTGC 

AAATAATACT 

TGGGAATTTT 

780 

TACTAAAACG 

GTGCGTTTCA 

TCCAAGAATC 

TATTAATATC 

CCTAACTTGA 

AATCATCATA 

840 

TACGTAACCC 

AACATAT TAA 

AGAGTTAATA 

ATGTTAAAAA 

AAGTCTCAGA 

AGAGAGAGAC 

900 

GTAGAGAACA 

CGGAAAGTGG 

TAACTGGTAA 

GCGTCGTCAT 

CGAGGATATA 

GTAGCTACGT 

960 
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GAGCAAACGT 

CTTCACTCAT 

CTCTGTCTAT 

TTCTCTTCGA 

ATACACGTAA 

TACATTTTCG 

1020 

ATTGGATTGA 

TCCTCCCTCG 

GTCCTATCCA 

AGTATCCATC 

CACGTAAACA 

AGAGCTTGTT 

1080 

CCTTTCTTGT 

TTTTTCTTTC 

TTTAAATAGT 

AAAAATACTT 

ATTTCATTTG 

TTTCGTTTGA 

1140 

TTTCATTATT 

ATTGTCTATG 

GCATTATATA 

C TAT AT AT AT 

TATTTCTACA 

ACATTGGCTG 

1200 

GCTCACGTTG 

TTCTCGTGTA 

TACAACAAAC 

TTAATTAATG 

TCTCTCTATT 

GCATTAGATA 

1260 

GTTTCGGAGC 

ATATCCATTA 

TGTGAAAGCC 

ACATTAAGTT 

ATAACTAAAA 

GTAGTTTTCG 

1320 

AAAGAGCTTA 

ATTAAGTTAT 

GTTCTGTTTC 

AAATAAAAAT 

GAACACGAGG 

GATTTTTTTT 

1380 

TTTTTTGACA 

GATCATTATT 

AACAAAAATG 

ATTACCTGAA 

GAAAGGGGAA 

AATAAT TATA 

1440 

GCTGATTACA 

GATCATTATT 

AACAAAAAGA 

ATTCTTGTCA 

CATCATTCAT 

TATAACAAGA 

1500 

AAT AT TAT AT 

TATATTAATT 

TAATCTTTCG 

CTAACACGCC 

CACAATATAT 

TAATCATATA 

1560 

CGTAATTTAG 

CTTATAAAAA 

GGACGGAAAG 

AGATTATTAC 

TGCGCCTAAA 

AAACTCACTA 

1620 

ATTCCAAAGA 

AAAAAAAAAG 

CTTGTATTTT 

TTCTTGACAA 

ACCAGCTCAC 

AGGCATTGCA 

1680 

TGATCAAACT 

CATCAGGTAC 

GTTTTGATTC 

CTTCTTCCAT 

AATTTTCCCA 

TCTTGAGGAA 

1740 

TGCAAATTTG 

GAGAGCGCTT 

TAGCTAAATC 

ACTGCCTTCA 

TTTTTTCACT 

TTGGATTTAA 

1800 

TAATTTGCAT 

TCCTCTCTTC 

CTCTCTGCTC 

TGTTCTGTTC 

TGTTCTGTTC 

TGATTTGAGT 

1860 

TTTCAATTAA 

TCGCTCGAGC 

AAAAGCTATT 

TCTCAACTCG 

TTAAATTTCT 

GTTCCCAGTT 

1920 

TGTTCGATTT 

TCAACAGTTT 

CACATTAAAG 

TTTGGGTTTT 

TGATGTTTGG 

TTGATGAAAC 

1980 

TCGAAATATG 

AAATGTTTGT 

GAATCTATTC 

CAGGGTGTTT 

AAAATAAGGG 

TTTGTTGTTC 

2040 

ATCTGCAGAG 

ATTATATGTT 

TTTACATGAA 

AG AT GAAT T C 

AAATGGCCAT 

GAGGAGGAGA 

2100 

AGAAGTTGGG 

GAATGGAGTT 

GTGGGGATTC 

TAGCTGAAAC 

AGTTAACAAA 

TGGGAGAGAC 

2160 

GAACACCATT 

GACGCCATCG 

CATTGCGCTC 

GCCTTTTACA 

CGGTGGGAAA 

GACAGAACCG 

2220 

GCATTTCCCG 

CATTGTGGTT 

CAGCCATCTG 

CTAAGCGTAT 

CCATCATGAT 

GCCTTGTATG 

2280 

AAGATGTTGG 

GTGTGAAATT 

TCTGATGATT 

TGTCTGATTG 

TGGGCTTATA 

CTTGGAATCA 

2340 

AACAACCTGA 

GGTGTGGGAA 

TTTGCATTAA 

AAAGAGTTCC 

TTTTTTTCTT 

CTATATATAT 

2400 

ATCAGTTTAT 

GAGATTTGAT 

TCTGTTTGCA 

GCTAGAAATG 

ATTCTTCCAG 

AGAGAGCATA 

2460 

CGCTTTCTTT 

TCACATACTC 

ATAAGGCACA 

GAAAGAGAAC 

ATGCCTTTGT 

TGGATAAAGT 

2520 

ATTACACTTT 

TCATTTATCC 

TTTTAGTCCT 

AT C TAAGATA 

CTGAGGAATG 

T TGACAAAAG 

2580 

GGGTATCCAA 

TTGCAGATTC 

TTTCTGAGAG 

AGTGACTTTG 

TGTGATTATG 

AGCTCATTGT 

2640 

TGGGGATCAT 

GGGAAACGAT 

TATTGGCGTT 

TGGTAAATAT 

GCAGGCAGAG 

CTGGTCTTGT 

2700 
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TGACTTCTTA CACGGACTTG GACAGCGTAA GCTCATGTTA TAATTCTGAT GATCAGGACA 2760 
TGTTTCTGTG CAGAACAAGA TGAGATGTAA TTTTCCATGT TTGATGCAGG ATATCTAAGT 2820 
CTAGGATACT CAACACCTTT CCTCTCGCTC GGTGCATCGT ATATGTATTC CTCATTGGCT 2880 
GCTGCAAAAG CCGCTGTAAT TTCTGTTGGT GAAGAAATTG CAAGCCAGGG ACTGCCATTA 2940 
GGAATCTGCC CTCTTGTATT TGTCTTCACC GGAACAGGAA ATGGTATCTT CTTTAGTTCT 3000 
ACTGCGAGTT CTTTGAATCC TTCTGCATAT GTTTCATCTC ATTAAAAAAT TTCTCATCCG 3060 
CAGTTTCTCT GGGGGCGCAA GAAATTTTCA AGCTTCTTCC TCACACTTTT GTTGAACCAA 3120 
GCAAACTTCC TGAACTATTT GTAAAAGTAA GTCACGCTTT GCTTTTTATT TGGTTTCAGA 3180 
GTTTTGAAGA TTCTGAAATG TATATTTCTC ACAGGACAAA GGAATTAGTC AAAATGGGAT 3240 
TTCAACAAAG CGAGTCTATC AAGTATATGG TTGTATTATT ACCAGCCAAG ACATGGTTGA 3300 
ACACAAAGAT CCATCAAAGT CATTCGACAA AGTAACACTT ACCTTCTTAG CTCCTTGGCT 3360 
GTGACTTTTG TTCCACTACG CTAAAGTAGA ATACCTATTA ATTCTTCAAG CTTATGATGT 3420 
TTAGGCCGAC TATTATGCAC ACCCGGAACA TTACAATCCA GTTTTCCACG AAAAGATATC 3480 
GCCATATACG TCTGTTCTTG GTAGATCCTG ATCACTGTTT TACCTTTAAA GCTCAAGAGT 3540 
TTACATATAA GCAAATCCTC TGTCCACTCC GTGACTGTGA CCATCTCATT TTGGTTAGTT 3600 
CCAGTGTGTA ACCCCTATGA CTTTCTGTGC AGTAAACTGT ATGTACTGGG AGAAGAGGTT 3660 

TCCCTGTCTT CTGAGCACAA AACAGCTTCA AGATTTAACA AAAAAAGGAC TCCCACTAGT 3720 

AGGCATATGT GATATAACTT GTGACATCGG TGGCTCCATT GAATTTGTTA ACCGAGCTAC 3780 

TTTAATCGAT TCCCCTTTCT TCAGGTAATA TATACTTAGG AAGAGCTTTC TTTTGAGTCA 3840 

TCTACGTTTA CTATGATGAA ACTCGTCGAG CTAAACACTA TCTCTAGGTT TAATCCCTCG 3900 

AACAATTCAT ACTACGATGA CATGGATGGG GATGGCGTAC TATGCATGGC TGTTGACATT 3960 
TTACCCACAG AATTTGCAAA AGAGGTATGT ATGAAGGTTA CAGTTATAGT ACTTAAGATT 4020 
AAATCTAAAG TTAAAAACCT TGTATTGAGT GGGAGTTCTT GTGTCCTGAA AAAGGCATCC 4080 
CAGCATTTTG GAGATATTCT TTCCGGATTT GTCGGTAGTT TGGCTTCAAT GACTGAAATT 4140 
TCAGATCTAC CAGCACATCT GAAGAGGGCT TGCATAAGCT ATAGGGGAGA ATTGACATCT 4200 
TTGTATGAGT ATATTCCACG TATGAGGAAG TCAAATCCAG AGTATGTTCT GCTTCGAGCG 4260 
TTACTTCATC TGAAATATTT AGGCCTCTTC TCTAAACTAT GTTTTCATCT TTACCCACTT 4320 
TAACTGCAGA GAGGCACAAG ATAATATTAT CGCCAACGGG GTTTCCAGCC AGAGAACATT 4380 
CAACATATTG GTTAGTTTTG ATGAAGAAAG TATATATAAC TAGTTTCCGA ATCATATGAT 4440 
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TTAAGCTAAT 

GAATTAAGAA 

AATATATAGT 

TCAAGACTTA 

TGATTCATAT 

CTCTATCAAC 

4500 

TTTTTGACCA 

AAGATTGATA 

CTTTTTCGAC 

ATCTGTCACA 

GCATTTTGTG 

ATGATTTTGA 

4560 

T TGAGACAAA 

TCATTTGTAG 

GTATCTCTGA 

GCGGACACCT 

ATTTGATAAG 

TTTCTGATAA 

4620 

ACGAAGCTCT 

TGATATGATC 

GAAGCGGCTG 

GTGGCTCATT 

TCATTTGGCT 

AAATGTGAAC 

4680 

TGGGGCAGAG 

CGCTGATGCT 

GAATCGTACT 

CAGAACTTGA 

AGTAAGTTTC 

TTTCTGGATA 

4740 

AAACCTAATC 

ATTCACATGG 

AACAACTGTC 

AAGAGTTTTT 

AATGTCACGT 

TTAGGTTCAA 

4800 

TGTCCTTTTC 

ACTAAGTCTC 

GTAAGTTTTT 

AAAACAAGTA 

AACAAACTAC 

AAGCCAAAAA 

4860 

CATTCTGGCC 

CCACATTAAC 

CTATTCCCAC 

TTGTTAAAGA 

ACCCATCTTG 

CATTATCTTG 

4920 

GTAGGTTGGT 

GCGGATGATA 

AGAGAGTATT 

GGATCAAATC 

ATTGATTCAT 

TAACTCGGTT 

4980 

AGCTAATCCA 

AATGAAGATT 

ATATATCCCC 

ACATAGAGAA 

GCAAATAAGA 

TCTCACTGAA 

5040 

GATTGGTAAA 

GTCCAGCAAG 

AAAAT GAGAT 

AAAAGAGAAG 

CCTGAAATGA 

CGAAAAAATC 

5100 

AGGCGTTTTG 

ATTCTTGGTG 

CTGGACGTGT 

GTGTCGCCCA 

GCTGCTGATT 

TCCTAGCTTC 

5160 

AGTTAGAACC 

ATTTCGTCAC 

AGCAATGGTA 

CAAAACATAT 

TTCGGAGCAG 

ACTCTGAAGA 

5220 

GAAAACAGAT 

GTTCATGTGA 

TTGTCGCGTC 

TCTGTATCTT 

AAGGATGCCA 

AAGAGGTAGG 

5280 

AGAAGCCTTT 

GGGCTTCATC 

TGAGTAATTC 

AGTGTATACG 

ATGAACTATC 

AATCTTTTAA 

5340 

AGTTTTACTG 

AT GAT CAAAT 

TTTCCGCAGA 

CGGTTGAAGG 

TAT T T CAGAT 

GTAGAAGCAG 

5400 

TTCGGCTAGA 

TGTATCTGAT 

AGTGAAAGTC 

TCCTTAAGTA 

TGTTTCTCAG 

GTATTTTCCT 

5460 

AACTTCTCTG 

TTCTTAGATC 

ACCTTTACTT 

CAAACTCCAC 

TGTTCAAATC 

CATGATCTTA 

5520 

TATTTTTTTT 

TCATTGCACG 

CAGGTTGATG 

TTGTCCTAAG 

TTTATTACCT 

GCAAGTTGTC 

5580 

ATGCTGTTGT 

AGCAAAGACA 

TGCATTGAGG 

TAAATTCCTA 

ACGTTTAATG 

CGTTTTCCGA 

5640 

GTGAAGTTAT 

GAAATTTGCA 

AATGTTATTC 

GACATAGAGG 

TTAAACTTCC 

TCTGCATAAC 

5700 


ACATTCTTTC AGTAGTTTCC GGTTCCTAAA TGTCTCTGTT TCTTCTTTCT GATTCACTCA 5760 
GCTGAAGAAG CATCTCGTCA CTGCTAGCTA TGTTGATGAT GAAACGTCCA TGTTACATGA 5820 
GAAGGCTAAG AGTGCTGGGA TAACGATTCT AGGCGAAATG GGACTGGACC CTGGAATCGG 5880 
TATGATATCT CACAACATAG TATCTCTTAA GATCATTTGT TCACTTGATT TAACTTAAGT 5940 
GCATTTATCT TCAAAATATT TCCCGGATAA CTGAGAAGGT GATCCTACAA TGAATCTTTC 6000 
AGATCACATG ATGGCGATGA AAATGATCAA CGATGCTCAT ATCAAAAAAG GGAAAGTGAA 6060 
GTCTTTTACC TCTTATTGTG GAGGGCTTCC CTCTCCTGCT GCAGCAAATA ATCCATTAGC 6120 
ATATAAATTT AGGTACGGTA GTCCTTTACG CCATTAACAT ATTTTGTTTT GTTTAACTCA 6180 
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TTTAGACATC CTTTCAGAAT TTCGCTTACT CAATTACATC TCGGTATTTT CAGCTGGAAC 6240 
CCTGCTGGAG CAATTCGAGC TGGTCAAAAC CCCGCCAAAT ACAAAAGCAA CGGCGACATA 6300 
ATACATGTTG ATGGTATGAA AAACAAAATA TGTCTACATG CAGGAGAGGT TGGAGTAGTT 6360 
TAGCTTCACT ACACATCATT TTTGTTTAAC CGAGCAATGT AAATCGCAGG GAAGAATCTC 6420 
TATGATTCCG CGGCAAGATT CCGAGTACCT AATCTTCCAG CTTTTGCATT GGAGTGTCTT 6480 
CCAAATCGTG ACTCCTTGGT TTACGGGGAA CATTATGGCA TCGAGAGCGA AGCAACAACG 6540 
ATATTTCGTG GAACACTCAG ATATGAAGGC ATGAATTCCA TAATCACAAC TCACGACTCA 6600 
CTTCTCCATA TCTGAAGGCT TAACACTTGT TTTCTTTTGG CTTGTACAGG GTTTAGTATG 6660 
ATAATGGCAA CACTTTCGAA ACTTGGATTC TTTGACAGTG AAGCAAATCA AGTACTCTCC 6720 
ACTGGAAAGA GGATTACGTT TGGTGCTCTT TTAAGTAACA TTCTAAATAA GGATGCCGAC 6780 
AATGAATCAG AGCCCCTAGC GGGAGAAGAA GAGATAAGCA AGAGAATTAT CAAGCTTGGA 6840 
CATTCCAAGG AGACTGCAGC CAAAGCTGCC AAAACAATTG TGTAAGCTTC TCCATGAAGA 6900 
TATATAATCT GAATGTTGCA GTGTGATTCC AATTCTTCTA CGAAACTCCT AACCCCAATT 6960 
CTTTTGTGGT GTCTTAGATT CTTGGGGTTC AACGAAGAGA GGGAGGTTCC ATCACTGTGT 7020 
AAAAGCGTAT TTGATGCAAC TTGTTACCTA ATGGAAGAGA AACTAGCTTA TTCCGGAAAT 7080 
GAACAGGTCT CTGTTTCATG TGAAAGCATT AGTTTTCTTC TCTCACTTGT ATTTGGTGTT 7140 
ACTTACTGAC ATAAACTTTG GACAATCTTT TGCATTATGT TTTCAGGACA TGGTGCTTTT 7200 
GCATCACGAA GTAGAAGTGG AATTCCTTGA AAGCAAACGT ATAGAGAAGC ACACTGCGAC 7260 
TCTTTTGGAA TTCGGGGACA TCAAGAATGG GCAAACAACA ACCGCTATGG CCAAGACTGT 7320 
TGGGATCCCT GCAGCCATTG GAGCTCTGGT CCTTACTAAG ACTTTGATCA CCACTTTTTC 7380 
CTGTCTATAT TTCTCTAAAA TGAAAGTTTT AAGCGTTTGT TTTATGATGT TGTGTGTTGC 7440 
AGCTGTTAAT TGAAGACAAG ATCAAGACAA GAGGAGTCTT AAGGCCTTTC GAAGCAGAGG 7500 
TGTATTTGCC AGGTAAATTA GAATTCCGCT TCAAAAGGAT GTGTGTTGCA GATAAAGACA 7560^ 
ATGATGTTGA TTTGTTGTGT GTTTGGGATA TGTGGTGTTA TACATACAGC TTTGGATATA 7620 
TTGCAAGCAT ATGGTATAAA GCTGATGGAG AAGGCAGAAT GATCAAAGAA CTCTGTATAT 7680 
TGTTTCTCTC TATAACTTGG AGTTGGAGAC AAAGCTGAAG AAGACAGAGA CATTAGACCA 7740 
GCAAAAAAAG AAGAAGAAGG AAGAAGATAA GCCTCGATCC TTGGGTGACG AGTATCTATA 7800 
TGTTTATATG TACTATATGT TATGTTGTAC AGAAGAAGTC GTGTCCACAA ATATCAATTG 7860 
ATGTCAGATG TCTAGTAAGT GATCATGTGT AGCATACAAA CTGGAGTAAT TTAAAAAGTG 7920 
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AATAAACAAA AATAATTACT AAACGTTATT CCAAGTAGCT TTCCAAGACA GTCACTTGCC 7980 
CTTTTCCAAT TTCCCTTGCA ATTAACTAAA TTGCTCTTCA CGATATGATA TTATACCAAA 8040 
ATGGTGATAC CTTGGGAATT GTTAATTTGA CTCATTTGAA CAAATCTCAT CTATAAAATC 8100 
ATCCCACCTC TCCACCACAT TTGTTCTCAC TACCAATCAA AAAATAATCT AGTCTTAAAC 8160 
(2) INFORMATION FOR SEQ ID NO:111: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3194 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:111: 

ATGAATTCAA ATGGCCATGA GGAGGAGAAG AAGTTGGGGA ATGGAGTTGT GGGGATTCTA 60 

TCTGAAACAG TTAACAAATG GGAGAGACGA ACAC CAT TGA CGCCATCGCA TTGCGCTCGC 120 

CTTTTACACG GTGGGAAAGA CAGAACCGGC ATTTCCCGCA TTGTGGTTCA GCCATCTGCT 180 

AAGCGTATCC ATCATGATGC CTTGTATGAA CATGTTGGGT GTGAAATTTC TGATGATTTG 240 

TCTGATTGTG GGCTTATACT TGGAATCAAA CAACCTGAGC TAGAAATGAT TCTTCCAGAG 300 

AGAGCATACG CTTTCTTTTC ACATACTCAT AAGGCACAGA AAGAGAACAT GCCTTTGTTG 360 

GATAAAATTC TTTCTGAGAG AGTGACTTTG TGTGATTATG AGCTCATTGT TGGGGATCAT 420 

GGGAAACGAT TATTGGCGTT TGGTAAATAT GCAGGCAGAG CTGGTCTTGT TGACTTCTTA 480 

CACGGACTTG GACAGCGATA TCTAAGTCTA GGATACTCAA CACCTTTCCT CTCGCTCGGT 540 

GCATCGTATA TGTATTCCTC ATTGGCTGCT GCAAAAGCCG CTGTAATTTC TGTTGGTGAA 600 

GAAATTGCAA GCCAGGGACT GCCATTAGGA ATCTGCCCTC TTGTATTTGT CTTCACCGGA 660 

ACAGGAAATG TTTCTCTGGG GGCGCAAGAA ATTTTCAAGC TTCTTCCTCA CACTTTTGTT 720 

GAACCAAGCA AACTTCCTGA ACTATTTGTA AAAGACAAAG GAATTAGTCA AAATGGGATT 780 

TCAACAAAGC GAGTCTATCA AGTATATGGT TGTATTATTA CCAGCCAAGA CATGGTTGAA 840 

CACAAAGATC CATCAAAGTC ATTCGACAAA GCCGACTATT ATGCACACCC GGAACATTAC 900 

AATCCAGTTT TCCACGAAAA GATATCGCCA TATACGTCTG TTCTTGTAAA CTGTATGTAC 960 

TGGGAGAAGA GGTTTCCCTG TCTTCTGAGC ACAAAACAGC TTCAAGATTT AACAAAAAAA 1020 

GGACTCCCAC TAGTAGGCAT ATGTGATATA ACTTGTGACA TCGGTGGCTC CATTGAATTT 1080 

GTTAACCGAG CTACTTTAAT CGATTCCCCT TTCTTCAGGT TTAATCCCTC GAACAATTCA 1140 
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TACTACGATG ACATGGATGG GGATGGCGTA CTATGCATGG CTGTTGACAT TTTACCCACA 1200 
GAATTTGCAA AAGAGGCATC CCAGCATTTT GGAGATATTC TTTCCGGATT TGTCGGTAGT 1260 
TTGGCTTCAA TGACTGAAAT TTCAGATCTA CCAGCACATC TGAAGAGGGC TTGCATAAGC 1320 
TATAGGGGAG AATTGACATC TTTGTATGAG TATATTCCAC GTATGAGGAA GTCAAATCCA 1380 
GAAGAGGCAC AAGATAATAT TATCGCCAAC GGGGTTTCCA GCCAGAGAAC ATTCAACATA 1440 
TTGGTATCTC TGAGCGGACA CCTATTTGAT AAGTTTCTGA TAAACGAAGC TCTTGATATG 1500 
ATCGAAGCGG CTGGTGGCTC ATTTCATTTG GCTAAATGTG AACTGGGGCA GAGCGCTGAT 1560 
GCTGAATCGT ACTCAGAACT TGAAGTTGGT GCGGATGATA AGAGAGTATT GGATCAAATC 1620 
ATTGATTCAT TAACTCGGTT AGCTAATCCA AATGAAGATT ATATATCCCC ACATAGAGAA 1680 
GCAAATAAGA TCTCACTGAA GATTGGTAAA GTCCAGCAAG AAAATGAGAT AAAAGAGAAG 1740 
CCTGAAATGA CGAAAAAATC AGGTGTTTTG ATTCTTGGTG CTGGACGTGT GTGTCGCCCA 1800 
GCTGCTGATT TCCTAGCTTC AGTTAGAACC ATTTCGTCAC AGCAATGGTA CAAAACATAT 1860 
TTCGGAGCAG ACTCTGAAGA GAAAACAGAT GTTCATGTGA TTGTCGCGTC TCTGTATCTT 1920 
AAGGATGCCA AAGAGACGGT TGAAGGTATT TCAGATGTAG AAGCAGTTCG GCTAGATGTA 1980 
TCTGATAGTG AAAGTCTCCT TAAGTATGTT TCTCAGGTTG ATGTTGTCCT AAGTTTATTA 2040 
CCTGCAAGTT GTCATGCTGT TGTAGCAAAG ACATGCATTG AGCTGAAGAA GCATCTCGTC 2100 
ACTGCTAGCT ATGTTGATGA TGAAACGTCC ATGTTACATG AGAAGGCTAA GAGTGCTGGG 2160 
ATAACGATTC TAGGCGAAAT GGGACTGGAC CCTGGAATCG ATCACATGAT GGCGATGAAA 2220 
ATGATCAACG ATGCTCATAT CAAAAAAGGG AAAGTGAAGT CTTTTACCTC TTATTGTGGA 2280 
GGGCTTCCCT CTCCTGCTGC AGCAAATAAT CCATTAGCAT ATAAATTTAG CTGGAACCCT 2340 

GCTGGAGCAA TTCGAGCTGG TCAAAACCCC GCCAAATACA AAAGCAACGG CGACATAATA 2400 

CATGTTGATG GGAAGAATCT CTATGATTCC GCGGCAAGAT TCCGAGTACC TAATCTTCCA 2460 

GCTTTTGCAT TGGAGTGTTT TCCAAATCGT GACTCCTTGG TTTACGGGGA ACATTATGGC 2520 

ATCGAGAGCG AAGCAACAAC GATATTTCGT GGAACACTCA GATATGAAGG GTTTAGTATG 2580 

ATAATGGCAA CACTTTCGAA ACTTGGATTC TTTGACAGTG AAGCAAATCA AGTACTCTCC 2640 

ACTGGAAAGA GGATTACGTT TGGTGCTCTT TTAAGTAACA TTCTAAATAA GGATGCAGAC 2700 

AATGAATCAG AGCCCCTAGC GGGAGAAGAA GAGATAAGCA AGAGAATTAT CAAGCTTGGA 2760 

CATTCCAAGG AGACTGCAGC CAAAGCTGCC AAAACAATTG TATTCTTGGG GTTCAACGAA 2820 

GAGAGGGAGG TTCCATCACT GTGTAAAAGC GTATTTGATG CAACTTGTTA CCTAATGGAA 2880 
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2940 



GAGAAACTAG CTTATTCCGG AAATGAACAG GACATGGTGC TTTTGCATCA CGAAGTAGAA 
GTGGAATTCC TTGAAAGCAA ACGTATAGAG AAGCACACTG CGACTCTTTT GGAATTCGGG 
GACATCAAGA ATGGACAAAC AACAACCGCT ATGGCCAAGA CTGTTGGGAT CCCTGCAGCC 
ATTGGAGCTC TGGTGTTAAT TGAAGACAAG ATCAAGACAA GAGGAGTCTT AAGGCCTCTC 
GAAGCAGAGG TGTATTTGCC AGCTTTGGAT ATATTGCAAG CATATGGTAT AAAGCTGATG 
GAGAAGGCAG AATGA 

(2) INFORMATION FOR SEQ ID NO:112: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1064 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:112: 

Met Asn Ser Asn Gly His Glu Glu Glu Lys Lys Leu Gly Asn Gly Val 
15 10 15 

Val Gly lie Leu Ser Glu Thr Val Asn Lys Trp Glu Arg Arg Thr Pro 
20 25 30 

Leu Thr Pro Ser His Cys Ala Arg Leu Leu His Gly Gly Lys Asp Arg 
35 40 45 

Thr Gly lie Ser Arg lie Val Val Gin Pro Ser Ala Lys Arg lie His 
50 55 60 

His Asp Ala Leu Tyr Glu His Val Gly Cys Glu lie Ser Asp Asp Leu 

65 70 75 80 

Ser Asp Cys Gly Leu lie Leu Gly lie Lys Gin Pro Glu Leu Glu Met 

85 90 95 

lie Leu Pro Glu Arg Ala Tyr Ala Phe Phe Ser His Thr His Lys Ala 
100 105 110 

Gin Lys Glu Asn Met Pro Leu Leu Asp Lys lie Leu Ser Glu Arg Val 
115 120 125 

Thr Leu Cys Asp Tyr Glu Leu He Val Gly Asp His Gly Lys Arg Leu 
130 135 140 


Leu Ala Phe Gly Lys Tyr Ala Gly Arg Ala Gly Leu Val Asp Phe Leu 
145 150 155 160 


His Gly Leu Gly Gin Arg Tyr Leu Ser Leu Gly Tyr Ser Thr Pro Phe 
165 170 175 


3000 

3060 

3120 

3180 

3195 
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Leu Ser Leu Gly Ala Ser Tyr Met 
180 

Ala Ala Val lie Ser Val Gly Glu 
195 200 

Leu Gly lie Cys Pro Leu Val Phe 
210 215 

Ser Leu Gly Ala Gin Glu lie Phe 
225 230 

Glu Pro Ser Lys Leu Pro Glu Leu 
245 

Gin Asn Gly lie Ser Thr Lys Arg 
260 

lie Thr Ser Gin Asp Met Val Glu 
275 280 

Asp Lys Ala Asp Tyr Tyr Ala His 
290 295 

His Glu Lys lie Ser Pro Tyr Thr 
305 310 

Trp Glu Lys Arg Phe Pro Cys Leu 
325 

Leu Thr Lys Lys Gly Leu Pro Leu 
340 

Asp lie Gly Gly Ser lie Glu Phe 
355 * 360 

Ser Pro Phe Phe Arg Phe Asn Pro 
370 375 

Met Asp Gly Asp Gly Val Leu Cys 
385 390 

Glu Phe Ala Lys Glu Ala Ser Gin 
405 

Phe Val Gly Ser Leu Ala Ser Met 
420 

His Leu Lys Arg Ala Cys lie Ser 
435 440 

Tyr Glu Tyr lie Pro Arg Met Arg 
450 455 

Asp Asn lie lie Ala Asn Gly Val 
465 470 


Tyr Ser Ser Leu Ala Ala Ala Lys 
185 190 

Glu lie Ala Ser Gin Gly Leu Pro 
205 

Val Phe Thr Gly Thr Gly Asn Val 
220 

Lys Leu Leu Pro His Thr Phe Val 
235 240 

Phe Val Lys Asp Lys Gly lie Ser 
250 255 

Val Tyr Gin Val Tyr Gly Cys lie 
265 270 

His Lys Asp Pro Ser Lys Ser Phe 
285 

Pro Glu His Tyr Asn Pro Val Phe 
300 

Ser Val Leu Val Asn Cys Met Tyr 
315 320 

Leu Ser Thr Lys Gin Leu Gin Asp 
330 335 

Val Gly lie Cys Asp lie Thr Cys 
345 350 

Val Asn Arg Ala Thr Leu lie Asp 
365 

Ser Asn Asn Ser Tyr Tyr Asp Asp 
380 

Met Ala Val Asp lie Leu Pro Thr 
395 400 

His Phe Gly Asp lie Leu Ser Gly 
410 415 

Thr Glu lie Ser Asp Leu Pro Ala 
425 430 

Tyr Arg Gly Glu Leu Thr Ser Leu 
445 

Lys Ser Asn Pro Glu Glu Ala Gin 
460 

Ser Ser Gin Arg Thr Phe Asn lie 
475 480 
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Leu Val Ser Leu Ser Gly His Leu 
485 

Ala Leu Asp Met lie Glu Ala Ala 
500 

Cys Glu Leu Gly Gin Ser Ala Asp 
515 520 

Val Gly Ala Asp Asp Lys Arg Val 
530 535 

Thr Arg Leu Ala Asn Pro Asn Glu 
545 550 

Ala Asn Lys lie Ser Leu Lys lie 
565 

lie Lys Glu Lys Pro Glu Met Thr 
580 

Gly Ala Gly Arg Val Cys Arg Pro 
595 600 

Arg Thr lie Ser Ser Gin Gin Trp 
610 615 

Ser Glu Glu Lys Thr Asp Val His 
625 630 

Lys Asp Ala Lys Glu Thr Val Glu 
645 

Arg Leu Asp Val Ser Asp Ser Glu 
660 

Val Asp Val Val Leu Ser Leu Leu 
675 680 

Ala Lys Thr Cys lie Glu Leu Lys 
690 695 

Val Asp Asp Glu Thr Ser Met Leu 
705 710 

lie Thr lie Leu Gly Glu Met Gly 
725 

Met Ala Met Lys Met lie Asn Asp 
740 

Lys Ser Phe Thr Ser Tyr Cys Gly 
755 760 

Asn Asn Pro Leu Ala Tyr Lys Phe 
770 775 


Phe Asp Lys Phe Leu lie Asn Glu 
490 495 

Gly Gly Ser Phe His Leu Ala Lys 
505 510 

Ala Glu Ser Tyr Ser Glu Leu Glu 
525 

Leu Asp Gin lie lie Asp Ser Leu 
540 

Asp Tyr lie Ser Pro His Arg Glu 
555 560 

Gly Lys Val Gin Gin Glu Asn Glu 
570 575 

Lys Lys Ser Gly Val Leu lie Leu 
585 590 

Ala Ala Asp Phe Leu Ala Ser Val 
605 

Tyr Lys Thr Tyr Phe Gly Ala Asp 
620 

Val lie Val Ala Ser Leu Tyr Leu 
635 640 

Gly lie Ser Asp Val Glu Ala Val 
650 655 

Ser Leu Leu Lys Tyr Val Ser Gin 
665 670 

Pro Ala Ser Cys His Ala Val Val 
685 

Lys His Leu Val Thr Ala Ser Tyr 
700 

His Glu Lys Ala Lys Ser Ala Gly 
715 720 

Leu Asp Pro Gly lie Asp His Met 
730 735 

Ala His lie Lys Lys Gly Lys Val 
745 750 

Gly Leu Pro Ser Pro Ala Ala Ala 
765 

Ser Trp Asn Pro Ala Gly Ala lie 
780 
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Arg Ala Gly Gin Asn Pro Ala Lys Tyr Lys Ser Asn Gly Asp lie lie 
785 790 795 800 

His Val Asp Gly Lys Asn Leu Tyr Asp Ser Ala Ala Arg Phe Arg Val 
805 810 815 

Pro Asn Leu Pro Ala Phe Ala Leu Glu Cys Phe Pro Asn Arg Asp Ser 
820 825 830 

Leu Val Tyr Gly Glu His Tyr Gly lie Glu Ser Glu Ala Thr Thr lie 
835 840 845 

Phe Arg Gly Thr Leu Arg Tyr Glu Gly Phe Ser Met lie Met Ala Thr 
850 855 860 

Leu Ser Lys Leu Gly Phe Phe Asp Ser Glu Ala Asn Gin Val Leu Ser 
865 870 875 880 

Thr Gly Lys Arg lie Thr Phe Gly Ala Leu Leu Ser Asn He Leu Asn 
885 890 895 

Lys Asp Ala Asp Asn Glu Ser Glu Pro Leu Ala Gly Glu Glu Glu lie 
900 905 910 

Ser Lys Arg He lie Lys Leu Gly His Ser Lys Glu Thr Ala Ala Lys 
915 920 925 

Ala Ala Lys Thr lie Val Phe Leu Gly Phe Asn Glu Glu Arg Glu Val 
930 935 940 

Pro Ser Leu Cys Lys Ser Val Phe Asp Ala Thr Cys Tyr Leu Met Glu 

945 950 955 960 

Glu Lys Leu Ala Tyr Ser Gly Asn Glu Gin Asp Met Val Leu Leu His 

965 970 975 

His Glu Val Glu Val Glu Phe Leu Glu Ser Lys Arg lie Glu Lys His 
980 ’ 985 990 

Thr Ala Thr Leu Leu Glu Phe Gly Asp lie Lys Asn Gly Gin Thr Thr 
995 1000 1005 

Thr Ala Met Ala Lys Thr Val Gly lie Pro Ala Ala lie Gly Ala Leu 
1010 1015 1020 

Val Leu lie Glu Asp Lys lie Lys Thr Arg Gly Val Leu Arg Pro Leu 

1025 1030 1035 1040 

Glu Ala Glu Val Tyr Leu Pro Ala Leu Asp lie Leu Gin Ala Tyr Gly 

1045 1050 1055 

lie Lys Leu Met Glu Lys Ala Glu 
1060 
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(2) 


INFORMATION FOR SEQ ID NO:113: 


{i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:113: 

TTYTCICAYA CICAYAARGC ICA 23 

(2) INFORMATION FOR SEQ ID'NO:114: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:114: 

TTYTCCCART ACATRCARTT 20 

(2) INFORMATION FOR SEQ ID NO:115: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 619 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:115: 

GAAAACATGC CTTTGCTGGA TAAGATTCTA GCTGAGAGGG CATCGTTATA TGACTATGAA 60 

TTAATTGTTG GGGACACTGG GAAAAGGTTA CTTGCATTTG GAAAATTCGC TGGTAGGGCT 120 

GGAATGATCG ACTTTTTGCG CGGATTAGGA CAGCGGTTTT TAAGTCTTGG ATATTCAACA 180 

CCTTTCTTGT CACTTGGATC ATCTTACATG TACCCTTCCC TGGCTGCTGC TAAGGCTGCT 240 

GTGATTTCTG TTGGTGAAAA ATTGCGACGC AGGGATTGCC ATTGGGGATT TGTCCCCTGG 300 

TTTGTTTATT TACTGGTTCA GGAAATGTTT GTTCTGGTGC ACAGGAGATA TTTAAGCTTC 360 

TTCCTCATAC CTTTGTTGAT CCATCTAAAC TACGCGACCT ACATAGAACG GACCCAGATC 420 

AACCAAGGCA TGCTTCAAAA AGAGTTTTCC AAGTTTATGG TTGTGTTGTG ACTGCCCAAG 480 

ACATGGTTGA ACCCAAAGAT CACGTGATAG TGTTTGACAA AGCAGACTAC TATGCACATC 540 
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CTGAGCATTA 

CAATCCCACT 

TTCCATGAAA 

AAATAGCACC 

ATATGCATCT 

GTTATTGTCA 

600 

ATTGCATGTA 

TTGGGAAAA 





619 

(2) INFORMATION 

FOR SEQ ID 

NO:116: 





(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 620 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 




(ii) MOLECULE TYPE: 

DNA (genomic) 




(xi) SEQUENCE DESCRIPTION: SEQ 

ID NO:116: 



GAGAATATGC 

CACTGTTAGA 

CAAGATCCTT 

GAAGAAAGGG 

TGTCCTTGTT 

TGATTATGAG 

60 

CTAATTGTTG 

GAGATGATGG 

GAAAAGATCA 

CTAGCATTTG 

GGAAATTTGC 

TGGTAGAGCT 

120 

GGACTGATAG 

ATTTCTTACA 

TGGTCTCGGA 

CAGCGATATT 

TGAGCCTTGG 

ATACTCCACT 

ISO 

CCATTTCTCT 

CTCTGGGACA 

TCTCATATGT 

TCCTTCGCTC 

GCTGCAGCCA 

AGGCTGCAGT 

240 

CATTGTCGTT 

GCAGAAGAGA 

TAGCAACATT 

TGGACTTCCA 

TCCGGAATTT 

GTCCGATAGT 

300 

GTTTGTGTTC 

ACTGGAGTTG 

GAAACGTCTC 

TCAGGGTGCG 

CAGGAGATAT 

TCAAGTTATT 

360 

GCCCCATACC 

TTTGTTGATG 

CTGAGAAGCT 

TCCCGAAATT 

TTTCAGGCCA 

GGAATCTGTC 

420 

TAAGCAATCT 

CAGTCGACCA 

AGAGAGTATT 

TCAACTTTAT 

GGTTGTGTTG 

TGACCTCTAG 

480 

AGACATAG T T 

TCTCACAAGG 

AT CCCACC AG 

ACAATTTGAC 

AAAGGTGACT 

ATTATGCTCA 

540 

TCCAGAACAC 

TACACCCCTG 

TTTTTCATGA 

AAGAATTGCT 

CCATATGCAT 

CTGTCATCGT 

600 

AAACTGCATG 

TATTGGGAAA 





620 


(2) INFORMATION FOR SEQ ID NO:117: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 206 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:117: 


Glu 

1 

Asn 

Met 

Pro 

Leu 

5 

Leu 

Asp 

Lys 

lie 

Leu 

10 

Ala 

Glu 

Arg 

Ala 

Ser 

15 

Leu 

Tyr 

Asp 

Tyr 

Glu 

20 

Leu 

He 

Val 

Gly Asp 
25 

Thr 

Gly 

Lys 

Arg 

Leu 

30 

Leu 

Ala 

Phe 

Gly 

Lys 

35 

Phe 

Ala 

Gly Arg 

Ala 

40 

Gly 

Met 

lie 

Asp 

Phe 

45 

Leu 

Arg 

Gly 
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Leu Gly Gin Arg Phe Leu Ser Leu Gly Tyr Ser Thr Pro Phe Leu Ser 
50 55 60 

Leu Gly Ser Ser Tyr Met Tyr Pro Ser Leu Ala Ala Ala Lys Ala Ala 

65 70 75 80 

Val lie Ser Val Gly Glu Xaa lie Ala Thr Gin Gly Leu Pro Leu Gly 

85 90 95 

lie Cys Pro Leu Val Cys Leu Phe Thr Gly Ser Gly Asn Val Cys Ser 

100 105 110 

Gly Ala Gin Glu lie Phe Lys Leu Leu Pro His Thr Phe Val Asp Pro 
115 120 125 

Ser Lys Leu Arg Asp Leu His Arg Thr Asp Pro Asp Gin Pro Arg His 
130 135 140 

Ala Ser Lys Arg Val Phe Gin Val Tyr Gly Cys Val Val Thr Ala Gin 

145 150 155 160 

Asp Met Val Glu Pro Lys Asp His Val lie Val Phe Asp Lys Ala Asp 

165 170 175 

Tyr Tyr Ala His Pro Glu His Tyr Asn Pro Thr Phe His Glu Lys lie 

180 185 190 

Ala Pro Tyr Ala Ser Val lie Val Asn Cys Met Tyr Trp Glu 
195 200 205 

(2) INFORMATION FOR SEQ ID NO:118: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 207 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: linear 


(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:118: 


Glu 

1 

Asn 

Met 

Pro 

Leu 

5 

Leu 

Asp 

Lys 

lie 

Leu 

10 

Glu 

Glu 

Arg 

Val 

Ser 

15 

Leu 

Phe 

Asp 

Tyr 

Glu 

20 

Leu 

He 

Val 

Gly 

Asp 

25 

Asp 

Gly 

Lys 

Arg 

Ser 

30 

Leu 

Ala 

Phe 

Gly 

Lys 

35 

Phe 

Ala 

Gly Arg 

Ala 

40 

Gly 

Leu 

lie 

Asp 

Phe 

45 

Leu 

His 

Gly 

Leu 

Gly 

50 

Gin 

Arg 

Tyr 

Leu 

Ser 

55 

Leu 

Gly 

Tyr 

Ser 

Thr 

60 

Pro 

Phe 

Leu 

Ser 

Leu 

65 

Gly 

Xaa 

Ser 

His 

Met 

70 

Xaa 

Pro 

Ser 

Leu 

Ala 

75 

Ala 

Ala 

Lys 

Ala 

Ala 

80 


62 



Val He Val Val Ala Glu Glu He Ala Thr Phe Gly Leu Pro Ser Glv 
85 90 95 

lie Cys Pro He Val Phe Val Phe Thr Gly Val Gly Asn Val Ser Gin 
100 105 HO 

Gly Ala Gin Glu lie Phe Lys Leu Leu Pro His Thr Phe Val Asp Ala 
115 120 125 

Glu Lys Leu Pro Glu lie Phe Gin Ala Arg Asn Leu Ser Lys Gin Ser 
130 135 140 

Gin Ser Thr Lys Arg Val Phe Gin Leu Tyr Gly Cys Val Val Thr Ser 

145 150 155 1 60 

Arg Asp lie Val Ser His Lys Asp Pro Thr Arg Gin Phe Asp Lys Gly 

165 170 175 

Asp Tyr Tyr Ala His Pro Glu His Tyr Thr Pro Val Phe His Glu Arq 
180 185 190 

lie Ala Pro Tyr Ala Ser Val lie Val Asn Cys Met Tyr Trp Glu 
195 200 . 205 

(2) INFORMATION FOR SEQ ID NO:119: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2582 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Glycine max 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 3..2357 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:119: 

TTGAACCCAA AGATCACGTG ATAGTGTTTG ACAAAGCAGA CTACTATTCA CACCCTGAGC 60 
ATTACAATCC CACTTTCCAT GAAAAAATAG CACCATATGC ATCTGTTATT GTCAATTGCA 120 

TGTATTGGGA GAAAAGATTT CCTCAATTGC CGAGCTATAA GCAGATGCAA GACTTAATGG 180 

GCCGGGGGAG CCCCCTTGTT GGAATAGCTG ACATAACGTG TGATATAGGG GGTTCAATTG 240 

AGTTTGTTAA CCGCGGTACT TCAATTGATT CACCCTTCTT CAGATATGAT CCCTTAACAA 300 

ATTCCTACCA TGATGATATG GAGGGGAATG GAGTGATATG CTTAGCTGTT GACATTCTTC 360 

CAACAGAATT TGCAAAGGAG GCTTCCCAAC ATTTTGGAAA CATACTTTCC CAATTTGTTG 420 
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TAAATTTGGC TTCTGCTACA GACATTACAA AGTTGCCTGC TCACTTAAGG AGAGCTTGCA 480 

TAGCCCATAA AGGAGTGCTA ACCTCCTTAT ATGATTATAT CCCACGCATG CGGAGTTCTG 540 

ATTCAGAGGA AGTATCAGAA AACGCAGAAA ATTCTCTATC CAACAAAAGG AAGTACAATA 600 

TATCGGTGTC TCTGAGTGGT CACTTATTTG ATCAGTTTCT GATAAATGAG GCCTTAGATA 660 
TTATTGAAGC TGCAGGAGGC TCCTTCCACT TAGTCAACTG CCATGTGGGT CAGAGCATTG 720 
AAGCCGTATC ATTCTCTGAA CTTGAAGTTG GTGCAGATAA CAGGGCTGTT CTGGATCAAA 780 
TCATTGATTC TTTAACTGCT ATTGCTAGTC CAACTGAACA TGATAGATTT TCAAATCAAG 840 
ATTCAAGTAA AATTTCACTT AAGCTTGGTA AAGTTGAAGA GAATGGCATA GAGAAGGAAT 900 
CTGACCCCAG AAAGAAGGCT GCGGTTTTAA TTCTTGGAGC TGGTCGGGTC TGTCAACCAG 960 
CTGCTGAAAT GTTATCATCA TTTGGAAGGC CATCATCGAG CCAATGGTAT AAAACATTGT 1020 
TGGAAGATGA TTTTGAATGT CAAACTGATG TAGAAGTCAT TGTGGGATCT CTGTACCTGA 1080 
AGGATGCAGA GCAGACTGTT GAGGGCATTC CAAATGTAAC CGGAATTCAG CTTGATGTGA 1140 
TGGATCGTGC CAATTTGTGT AAGTACATTT CACAGGTTGA CGTTGTTATA AGTTTGCTGC 1200 
CCCCAAGTTG TCATATTATT GTAGCAAATG CTTGCATTGA GCTGAAAAAA CATCTTGTCA 1260 
CTGCTAGCTA TGTTGATAGC TCCATGTCAA TGCTAAATGA TAAGGCTAAA GATGCTGGCA 1320 
TAACAATTCT TGGAGAGATG GGCTTGGACC CAGGAATTGG TCATATGATG GCAATGAAGA 1380 
TGATCAACCA AGCACATGTG AGGAAGGGGA AAATAAAGTC TTTCACTTCT TATTGTGGTG 1440 
GACTTCCATC TCCTGAAGCT GCTAACAATC CATTAGCATA TAAATTCAGT TGGAATCCTG 1500 
CAGGAGCCAT CCGAGCTGGG CGCAATCCTG CCACCTACAA ATGGGGTGGT GAAACTGTAC 1560 
ATATTGATGG GGACGATCTT TATGATTCGG CTACAAGACT AAGGCTACCG GACCTTCCTG 1620 
CTTTTGCTTT GGAATGTCTC CCAAATCGCA ATTCATTACT TTATGGGGAT TTGTATGGAA 1680 
TAACTGAAGC ATCAACCATT TTCCGTGGAA CCCTCCGCTA TGAAGGATTT AGTGAGATCA 1740 
TGGGGACACT GTCTAGGATT AGCTTATTTA ACAATGAAGC CCATTCGTTG CTAATGAATG 1800 
GACAAAGACC AACTTTCAAA AAATTCTTAT TTGAACTTCT CAAAGTTGTT GGTGATAATC 1860 
CAGATGAACT ATTGATAGGA GAGAATGACA TCATGGAGCA AATATTAATA CAAGGGCACT 1920 
GCAAAGATCA AAGAACGGCA ATGGAGACAG CAAAAACAAT CATTTTCTTG GGACTTCTTG 1980 
ACCAAACTGA AATCCCTGCT TCCTGCAAAA GTGCTTTTGA TGTTGCTTGT TTCCGCATGG 2040 
AGGAGAGGTT ATCATACACC AGCACAGAAA AGGATATGGT GCTTTTGCAT CATGAAGTGG 2100 
AAATAGAATA CCCAGATAGC CAAATTACAG AGAAGCATAG AGCTACTTTA CTTGAATTTG 2160 
GGAAGACTCT TGATGAAAAA ACCACAACTG CCATGGCCCT TACTGTTGGT ATTCCAGCTG 2220 
CTGTTGGAGC TTTGCTTTTA TTGACAAACA AAATTCAGAC AAGAGGAGTC TTAAGGCCTA 2280 
TCGAACCTGA AGTATACAAT CCAGCACTGG ATATTATAGA AGCTTATGGG ATCAAGTTGA 2340 
TAGAGAAGAC CGAGTAATTT GCATYTATGA ATTGATGTAT AGGTGTACAT TAATGTACAC 2400 
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CATGCAATGT TTGATTTGAA TAAGATAAAA TATAATAATT ACTGCAGTCA TGGAATTGCA 
ACTGCCATTC TATGCAACTG TCAGAAATGG ACCACACGGT ACCAGCATAG TTAAAACACT 
TAGGCAGATA CCAATTTCAA TTGCAGCAGT ACAATCCAAC CAGTTATGAA GTATGGTTCT 
AG 

(2) INFORMATION FOR SEQ ID NO:120: 

(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 3265 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) 

MOLECULE TYPE: 

cDNA to mRNA 

(iii) 

HYPOTHETICAL: 

NO 

(iv) 

ANTI-SENSE: NO 


(vi) 

ORIGINAL SOURCE 
(A) ORGANISM: 

Zea mays 

(ix) 

FEATURE: 

(A) NAME/KEY: 

(B) LOCATION: 

CDS 

3..3071 

(xi) 

SEQUENCE DESCRIPTION: SEQ ID 


ATTGTGCCCG CCTTCTGCTA GGAGGAGGCA AGAACGGACC TCGAGTAAAC CGGATTATTG 
TGCAGCCAAG CACAAGGAGG ATCCATCATG ACGCTCAGTA TGAGGATGCA GGATGCGAGA 
TTTCAGAAGA CCTGTCAGAA TGCGGCCTTA TCATAGGCAT CAAACAACCC AAGCTGCAGA 
TGATTCTTTC AGATAGAGCG TACGCTTTCT TTTCACACAC ACACAAAGCC CAAAAAGAGA 
ATATGCCACT GTTAGACAAG ATCCTTGAAG AAAGGGTGTC CTTGTTTGAT TATGAGCTAA 
TTGTTGGAGA TGATGGGAAA AGATCACTAG CATTTGGGAA ATTTGCTGGT AGAGCTGGAC 
TGATAGATTT CTTACATGGT CTCGGACAGC GATATTTGAG CCTTGGATAC TCGACTCCAT 
TTCTCTCTCT GGGACAATCT CATATGTATC CTTCGCTCGC TGCAGCCAAG GCTGCAGTCA 
TTGTCGTTGC AGAAGAGATA GCAACATTTG GACTTCCATC CGGAATTTGT CCGATAGTGT 
TTGTGTTCAC TGGAGTTGGA AACGTCTCTC AGGGTGCGCA GGAGATATTC AAGTTATTGC 
CCCATACCTT TGTTGATGCT GAGAAGCTTC CCGAAATTTT TCAGGCCAGG AATCTGTCTA 
AGCAATCTCA GTCGACCAAG AGAGTATTTC AACTTTATGG TTGTGTTGTG ACCTCTAGAG 
ACATAGTTTC TCACAAGGAT CCCACCAGAC AATTTGACAA AGGTGACTAT TATGCTCATC 
CAGAACACTA CACCCCTGTT TTTCATGAAA GAATTGCTCC ATATGCATCT GTCATCGTAA 
ACTGTATGTA TTGGGAGAAG AGGTTTCCAC CATTACTAAA TATGGATCAG TTACAGCAAT 
TGATGGAGAC TGGTTGTCCT TTAGTCGGCG TTTGTGACAT AACTTGTGAT ATTGGAGGTT 


2460 

2520 

2580 

2582 


60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 


65 



CCATTGAATT 

TATCAACAAG 

AGTACATCAA 

TAGAGAGGCC 

TTTCTTTCGG 

TATGATCCTT 

1020 

CTAAGAATTC 

ATACCATGAT 

GATATGGAAG 

GTGCCGGAGT 

GGTCTGCTTG 

GCTGTTGACA 

1080 

TTCTCCCTAC 

AGAATTCTCT 

AAAGAGGCCT 

CCCAACATTT 

TGGAAACATA 

CTATCTAGAC 

1140 

TTGTTGCTAG 

TTTGGCCTCA 

GTGAAGCAAC 

CGGCAGAACT 

TCCTTCCTAC 

TTGAGAAGAG 

1200 

CTTGCATTGC 

ACATGCTGGC 

AGATTAACTC 

CTTTGTATGA 

ATATATCCCT 

AGGATGAGAA 

1260 

ATACTATGAT 

AGATTTGGCA 

CCCGCAAAAA 

CAAATCCATT 

GCCTGACAAG 

AAGTATAGCA 

1320 

CCCTGGTATC 

TCTCAGTGGG 

CACCTATTTG 

ATAAGTTCCT 

TATAAATGAA 

GCTTTGGACA 

1380 

TCATTGAGAC 

AGCTGGAGGT 

TCATTTCACT 

TGGTTAGATG 

TGAAGTTGGA 

CAAAGCACGG 

1440 

ATGATATGTC 

AT AC T C AG AG 

CTTGAAGTAG 

GAGCAGATGA 

TACTGCCACA 

TTGGATAAAA 

1500 

TTATTGATTC 

CTTGACTTCT 

TTAGCTAATG 

AACATGGTGG 

AGATCACGAT 

GCCGGGCAAG 

1560 

AAATTGAATT 

AGCTCTGAAG 

ATAGGAAAAG 

TCAATGAGTA 

TGAAACTGAC 

GTCACAATTG 

1620 

ATAAAGGAGG 

GCCAAAGATT 

TTAATTCTTG 

GAGCTGGAAG 

AGTCTGTCGG 

CCAGCTGCTG 

1680 

AGTTTCTGGC 

ATCTTACCCA 

GACATATGTA 

CCTATGGTGT 

TGATGACCAT 

GATGCAGATC 

1740 

AAATTCATGT 

TATCGTGGCA 

TCTTTGTATC 

AAAAAGATGC 

AGAAGAGACA 

GTTGATGGTA 

1800 

TTGAAAATAC 

AACTGCTACC 

CAGCTTGATG 

TTGCTGATAT 

TGGAAGCCTT 

TCAGATCTTG 

1860 

TTTCTCAGGT 

TGAGGTTGTA 

ATTAGCTTGC 

TGCCTGCTAG 

TTTTCATGCT 

GCCATTGCAG 

1920 

GAGTATGCAT 

AGAGTTGAAG 

AAGCACATGG 

TAACGGCAAG 

CTATGTTGAT 

GAATCCATGT 

1980 

CAAACTTGAG 

CCAAGCTGCC 

AAAGATGCAG 

GTGTAACTAT 

ACTTTGTGAA 

ATGGGCCTAG 

2040 

ATCCTGGCAT 

AGATCACTTG 

ATGTCAATGA 

AGATGATTGA 

TGAAGCTCAT 

GCACGAAAGG 

2100 

GAAAAATAAA 

GGCATTTACA 

TCTTACTGTG 

GTGGATTGCC 

ATCTCCAGCT 

GCAGCAAACA 

2160 

ATCCGCTTGC 

CTATAAATTC 

AGTTGGAACC 

CAGCTGGTGC 

ACTCCGGTCA 

GGGAAAAATC 

2220 

CTGCAGTCTA 

CAAATTTCTT 

GGTGAGACGA 

J 

TCCATGTAGA 

TGGTCATAAC 

TTGTATGAAT 

2280 

CAGCAAAGAG 

GCTCAGACTA 

CGAGAGCTTC 

CAGCTTTTGC 

TCTGGAACAC 

TTGCCAAATC 

2340 

GGAATTCCTT 

GATATATGGT 

GACCTTTATG 

GTATCTCCAA 

AGAAGCATCC 

ACCATATATA 

2400 

GGGCTACTYT 

TCGTTACGAA 

GGTTTTAGTG 

AGATTATGGT 

AACCCTTTCC 

AAAACTGGGT 

2460 

TCTTTGATGC 

TGCAAATCAT 

CCACTGCTGC 

AAGATAC TAG 

TCGTCCAACA 

TATAAGGGTT 

2520 

TCCTTGATGA 

ACTACTGAAT 

AATATCTCCA 

CAATTAACAC 

GGACTTAGAT 

ATTGAAGCTT 

2580 

CTGGTGGATA 

CGATGATGAC 

CTGATTGCCA 

GACTGTTGAA 

GCTCGGGTGT 

TGCAAAAATA 

2640 

AGGAAATAGC 

TGTTAAGACA 

GTCAAAACCA 

TCAAGTTCTT 

GGGACTACAT 

GAAGAGAC T C 

2700 

AAATACCTAA 

GGGTTGTTCG 

AGCCCATTTG 

ATGTGATTTG 

CCAGCGAATG 

GAACAGAGGA 

2760 

TGGCCTATGG 

CCACAATGAG 

CAAGACATGG 

TACTGCTCCA 

CCACGAAGTC 

GAGGTGGAAT 

2820 

ACCCGGACGG 

GCAACCCGCC 

GAAAAGCACC 

AAGCGACGCT 

ACTGGAGTTC 

GGGAAGGTTG 

2880 

AAAATGGCAG 

GTCCACCACT 

GCCATGGCGC 

TGACCGTCGG 

CATTCCAGCA 

GCAATAGGGG 

2940 


66 



3000 


CCCTGCTATT GCTAAAGAAT AAGGTCCAGA CGAAAGGAGT GATCAGGCCT CTGCAACCGG 
AAATCTACGT TCCAGCATTG GAGATCTTGG AGTCGTCGGG CATCAAGCTG GTTGAGAAAG 
TGGAGACTTG AAAGTTCCCT GATACACAGA TAAAGATAGT ATGATATAGC AGGGCACATG 
TATCTTTTGT ATTAACTCCG TTCTGGAATA TATATTTGTG AACTAAAATG TGACAAATAA 
AAAGAACGGG TGGAGTATAT TGTAAGAGAC GGCAAAGAAA CCTCTGTATA TATGACCTGT 
CGATATCAAA TAATGCCGAT CAGTT 

(2) INFORMATION FOR SEQ ID NO:121: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 784 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 


(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Glycine max 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:121: 

Glu Pro Lys Asp His Val lie Val Phe Asp Lys Ala Asp Tyr Tyr Ser 
15 10 15 

His Pro Glu His Tyr Asn Pro Thr Phe His Glu Lys He Ala Pro Tyr 
20 25 30 

Ala Ser Val lie Val Asn Cys Met Tyr Trp Glu Lys Arg Phe Pro Gin 
35 4 0 45 

Leu Pro Ser Tyr Lys Gin Met Gin Asp Leu Met Gly Arg Gly Ser Pro 
1,0 55 60 


Leu Val Gly lie Ala Asp lie 
65 70 

Phe Val Asn Arg Gly Thr Ser 
85 

Pro Leu Thr Asn Ser Tyr His 
100 

Cys Leu Ala Val Asp lie Leu 
115 

Gin His Phe Gly Asn lie Leu 

130 135 

Ala Thr. Asp lie Thr Lys Leu 
!45 150 

Ala His Lys Gly Val Leu Thr 
165 


Thr Cys Asp lie Gly Gly Ser lie Glu 
75 80 

lie Asp Ser Pro Phe Phe Arg Tyr Aso 
90 95 

Asp Asp Met Glu Gly Asn Gly Val lie 
i05 no 

Pro Thr Glu Phe Ala Lys Glu Ala Ser 
120 125 

Ser Gin Phe Val Val Asn Leu Ala Ser 
140 

Pro Ala His Leu Arg Arg Ala Cys lie 
155 160 

Ser Leu Tyr Asp Tyr lie Pro Arg Met 
170 175 


3060 

3120 

3180 

3240 

3265 
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Arg Ser Ser Asp Ser Glu Glu Val Ser Glu Asn Ala Glu Asn Ser Leu 
180 185 190 

Ser Asn Lys Arg Lys Tyr Asn He Ser Val Ser Leu Ser Gly His Leu 

200 205 

PhS o?n Gln PhS LeU Ile ASn Glu Ala Leu As P Ile Ile Glu Ala Ala 
^ ±u 215 220 

Gly Gly Ser Phe His Leu Val Asn Cys His Val Gly Gin Ser Ile Glu 

22a 230 235 240 

Ala Val Ser Phe Ser Glu Leu Glu Val Gly Ala Asp Asn Arg Ala Val 

245 250 255 

Leu Asp Gin lie Ile Asp Ser Leu Thr Ala Ile Ala Ser Pro Thr Glu 
260 265 270 

His Asp Arg Phe Ser Asn Gin Asp Ser Ser Lys lie Ser Leu Lys Leu 
275 280 285 

617 290 Val G1U G1U ASn o l Y I1S G1U LyS Glu Ser As P Arg Lys 

295 300 

305 Ala Ala Val L0U lin LeU Gly Ala GlY Arg Val Cys Gln Pro Ala 

JiU 315 320 

Ala Glu Met Leu Ser Ser Phe Gly Arg Pro Ser Ser Ser Gln Trp Tyr 

330 335 

Lys Thr Leu Leu Glu Asp Asp Phe Glu Cys Gln Thr Asp Val Glu Val 
J4U 345 350 

Ile Val Gly Ser Leu Tyr Leu Lys Asp Ala Glu Gln Thr Val Glu Gly 

360 365 

He Pro Asn Val Thr Gly lie Gln Leu Asp Val Met Asp Arg Ala Asn 
J v 375 380 

Leu Cys Lys Tyr Ile Ser Gln Val Asp Val Val Ile Ser Leu Leu Pro 

380 395 400 

Pro Ser Cys His lie Ile Val Ala Asn Ala Cys Ile Glu Leu Lys Lys 

405 410 4 i 5 

His Leu Val Thr Ala Ser Tyr Val Asp Ser Ser Met Ser Met Leu Asn 

ASP LyS LyS As P Ala Gly Ile Thr Ile Leu Gly Glu Met Gly Leu 

440 445 

ASP 450 Gly 116 Gly H±S !?« Met Ala Met Lys Met Ile Asn Gln Ala 
* 3U 455 460 

His Val Arg Lys Gly Lys lie Lys Ser Phe Thr Ser Tyr Cys Gly Gly 

4/0 475 480 

Leu Pro Ser Pro Glu Ala Ala Asn Asn Pro Leu Ala Tyr Lys Phe Ser 
485 490 495 

Trp Asn Pro Ala Gly Ala Ile Arg Ala Gly Arg Asn Pro Ala Thr Tyr 
U 503 510 
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Lys Trp Gly Gly Glu Thr Val His lie Asp Gly Asp Asp Leu Tyr Asp 
OJ -° 520 525 

Ser Thr Ar 9 Leu Arg Leu Pro Asp Leu Pro Ala Phe Ala Leu Glu 

3JU 535 540 

Cys Leu Pro Asn Arg Asn Ser Leu Leu Tyr Gly Asp Leu Tyr Gly He 

550 555 560 

Thr Glu Ala Ser Thr lie Phe Arg Gly Thr Leu Arg Tyr Glu Gly Phe 
565 570 57 5 

Ser Glu He Met Gly Thr Leu Ser Arg He Ser Leu Phe Asn Asn Glu 
oou 585 590 

Ala His Ser Leu Leu Met Asn Gly Gin Arg Pro Thr Phe Lys Lys Phe 

Leu Phe Glu Leu Leu Lys Val Val Gly Asp Asn Pro Asp Glu Leu Leu 

63.5 620 

lie Gly Glu Asn Asp lie Met Glu Gin He Leu He Gin Gly His Cys 

faJU 635 640 

Lys Asp Gin Arg Thr Ala Met Glu Thr Ala Lys Thr He He Phe Leu 
645 650 • 655 

Gly Leu Leu Asp Gin Thr Glu He Pro Ala Ser Cys Lys Ser Ala Phe 

665 670 

Asp Val Ala Cys Phe Arg Met Glu Glu Arg Leu Ser Tyr Thr Ser Thr 

680 685 

G1U 690 ASP Met Val LeU HiS H±S Glu Val Glu He Glu Tyr Pro 

695 700 

Asp Ser Gin He Thr Glu Lys His Arg Ala Thr Leu Leu Glu Phe Gly 

710 715 720 

Lys Thr Leu Asp Glu Lys Thr Thr Thr Ala Met Ala Leu Thr Val Gly 
725 7 30 735 

He Pro Ala Ala Val Gly Ala Leu Leu Leu Leu Thr Asn Lys He Gin 

74 5 750 

Thr Arg Gly Val Leu Arg Pro lie Glu Pro Glu Val Tyr Asn Pro Ala 
M 760 765 

Leu Asp He He Glu Ala Tyr Gly He Lys Leu He Glu Lys Thr Glu ' 

' ,i3 780 


(2) INFORMATION FOR SEQ ID NO:122: 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1022 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Zea mays 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:122: 


Cys Ala Arg Leu 
1 

Arg lie lie Val 
20 

Tyr Glu Asp Ala 
35 

Leu lie lie Gly 
50 


Arg Ala Tyr Ala 
65 


Met Pro Leu Leu 


Tyr Glu Leu lie 
100 

Lys Phe Ala Gly 
115 

Gin Arg Tyr Leu 
130 


Gin Ser His Met 
145 

Val Val Ala Glu 


Pro lie Val Phe 
180 

Gin Glu lie Phe 
195 

Leu Pro Glu lie 
210 

Thr Lys Arg Val 
225 

lie Val Ser His 


Tyr Ala His Pro 
260 

Pro Tyr Ala Ser 
275 

Pro Pro Leu Leu 
290 


Leu Leu Gly Gly 
5 


Gin Pro Ser Thr 


Gly Cys Glu lie 
40 


He Lys Gin Pro 
55 


Phe Phe Ser His 
70 


Asp Lys lie Leu 
85 


Val Gly Asp Asp 


Arg Ala Gly Leu 
120 


Ser Leu Gly Tyr 
135 


Tyr Pro Ser Leu 
150 


Glu lie Ala Thr 
165 


Val Phe Thr Gly 


Lys Leu Leu Pro 
200 


Phe Gin Ala Arg 
215 


Phe Gin Leu Tyr 
230 


Lys Asp Pro Thr 
245 


Glu His Tyr Thr 


Val lie Val Asn 
280 

Asn Met Asp Gin 
295 


Gly Lys Asn Gly 
10 


Arg Arg lie His 
25 


Ser Glu Asp Leu 


Lys Leu Gin Met 
60 


Thr His Lys Ala 
75 


Glu Glu Arg Val 
90 


Gly Lys Arg Ser 
105 


lie Asp Phe Leu 


Ser Thr Pro Phe 
140 


Ala Ala Ala Lys 
155 


Phe Gly Leu Pro 
170 


Val Gly Asn Val 
185 


His Thr Phe Val 


Asn Leu Ser Lys 
220 


Gly Cys Val Val 
235 


Arg Gin Phe Asp 
250 


Pro Val Phe His 
265 


Cys Met Tyr Trp 


Leu Gin Gin Leu 
300 


Pro Arg Val Asn 
15 


His Asp Ala Gin 
30 

Ser Glu Cys Gly 
45 

lie Leu Ser Asp 


Gin Lys Glu Asn 
80 


Ser Leu Phe Asp 
95 


Leu Ala Phe Gly 
110 


His Gly Leu Gly 
125 


Leu Ser Leu Gly 


Ala Ala Val lie 
160 


Ser Gly lie Cys 
175 


Ser Gin Gly Ala 
190 


Asp Ala Glu Lys 
205 


Gin Ser Gin Ser 


Thr Ser Arg Asp 
240 


Lys Gly Asp Tyr 
255 


Glu Arg lie Ala 
270 


Glu Lys Arg Phe 
285 


Met Glu Thr Gly 
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Tyr Asp Pro Ser Lys Asn Ser Tyr His Asp Asp Met Glu Gly Ala Glv 
340 345 350 

Val Val Cys Leu Ala Val Asp lie Leu Pro Thr Glu Phe Ser Lys Glu 
355 360 365 

Ala Gln His Phe Gly Asn Ile Leu Ser Ar 9 Leu Val Ala Ser Leu 

370 375 380 

Ala Ser Val Lys Gin Pro Ala Glu Leu Pro Ser Tyr Leu Arg Arg Ala 
385 390 395 40Q 

Cys Ile Ala His Ala Gly Arg Leu Thr Pro Leu Tyr Glu Tyr Ile Pro 

4 05 410 415 

Arg Met Arg Asn Thr Met Ile Asp Leu Ala Pro Ala Lys Thr Asn Pro 
420 425 430 

Leu Pro Asp Lys Lys Tyr Ser Thr Leu Val Ser Leu Ser Gly His Leu 
4 ^ 4 4 0 4 4 5 

Phe Asp Lys Phe Leu Ile Asn Glu Ala Leu Asp Ile Ile Glu Thr Ala 
4;3U 455 460 

Gly Gly Ser Phe His Leu Val Arg Cys Glu Val Gly Gin Ser Thr Asp 

470 475 48 q 

Asp Met Ser Tyr Ser Glu Leu Glu Val Gly Ala Asp Asp Thr Ala Thr 
485 490 495 

Leu Asp Lys lie Ile Asp Ser Leu Thr Ser Leu Ala Asn Glu His Gly 

500 505 510 Y 

Gly Asp His Asp Ala Gly Gin Glu Ile Glu Leu Ala Leu Lys Ile Gly 

01:3 520 525 

Lys Val Asn Glu Tyr Glu Thr Asp Val Thr Ile Asp Lys Gly Gly Pro 

535 540 

Lys Ile Leu Ile Leu Gly Ala Gly Arg Val Cys Arg Pro Ala Ala Glu 

555 560 

Phe Leu Ala Ser Tyr Pro Asp Ile Cys Thr Tyr Gly Val Asp Asp His 
565 570 575 

Asp Ala Asp Gin Ile His Val Ile Val Ala Ser Leu Tyr Gin Lys Asp 
580 585 590 

Ala Glu Glu Thr Val Asp Gly lie Glu Asn Thr Thr Ala Thr Gin Leu 

600 605 

Asp Val Ala Asp lie Gly Ser Leu Ser Asp Leu Val Ser Gin Val Glu 
oxu 615 620 

Val Val Ile Ser Leu Leu Pro Ala Ser Phe His Ala Ala Ile Ala Gly 

630 635 640 

Val Cys Ile Glu Leu Lys Lys His Met Val Thr Ala Ser Tyr Val Asd 
645 650 655 

Glu Ser Met Ser Asn Leu Ser Gin Ala Ala Lys Asp Ala Gly Val Thr 
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He Leu Cys Glu Met Gly Leu Asp Pro Gly lie Asp His Leu Met Ser 
° 680 685 

Met Lys Met lie Asp Glu Ala His Ala Arg Lys Gly Lys lie Lys Ala 

byb 700 

Phe Thr Ser Tyr Cys Gly Gly Leu Pro Ser Pro Ala Ala Ala Asn Asn 

710 715 720 

Pro Leu Ala Tyr Lys Phe Ser Trp Asn Pro Ala Gly Ala Leu Arg Ser 
725 730 735 

Gly Lys Asn Pro Ala Val Tyr Lys Phe Leu Gly Glu Thr He His Val 
qU ? 45 750 

Asp Gly His Asn Leu Tyr Glu Ser Ala Lys Arg Leu Arg Leu Arg Glu 
30 760 765 

Leu Pro Ala Phe Ala Leu Glu His Leu Pro Asn Arg Asn Ser Leu He 

775 780 

Tyr Gly Asp Leu Tyr Gly Ue Ser Lys Glu Ala S.r Thr II. Tyr Arc, 


795 


800 


Ala Thr Xaa Arg Tyr Glu Gly Phe Ser Glu He Met Val Thr Leu Ser 

810 815 

Lys Thr Gly Phe Ph. Asp Ala Ala Asn His Pro Leu Leu Gin Asp Thr 

825 830 

Ser Arg Pro Thr Tyr Lys Gly Phe Leu Asp Glu Leu Leu Asn Asn He 

840 845 

Ser 850 I1S ASn Thr ASP of" ASP 116 Glu Ser Gly Gly Tyr Asp 

000 860 

865 ASP LSU 116 Ala 870 LSU LSU LyS LSU Gly CyS Cys Lys Asn Lys 


875 


880 


Glu He Ala Val Lys Thr Val Lys Thr He Lys Phe Leu Gly Leu His 

890 895 

Glu Glu Thr Gin He Pro Lys Gly Cys Ser Ser Pro Phe Asp Val He 

905 910 

Cys Gin Arg Met Glu Gin Arg Met Ala Tyr Gly His Asn Glu Gin Asp 

y20 925 

Met 930 ^ ^ H±S H±S 935 Val G1U Val Glu T Y^ Pro Asp Gly Gin 

935 940 

Pro Ala Glu Lys His Gin Ala Thr Leu Leu Glu Phe Gly Lys Val Glu 

955 960 

Asn Gly Ar, S.r Thr Thr Ala Met Ala Leu Thr Val Gly II. p t o Ala 

970 975 

Ala lie Gly Ala Leu Leu Leu Leu Lys Asn Lys Val Gin Thr Lys Gly 

985 990 

val II. Arc Pro Leu Gin Pro Glu lie Tyr Val Pro Ala Leu Glu He 

100 ° 100 5 

LeU 1010 Ser Ser GlY Ile ^fc LeU Val Glu Lys Val Glu Thr 

1015 1020 
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(2) INFORMATION FOR SEQ ID NO:123: 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1908 base pairs 

{B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Zea mays 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 3..1908 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:123: 

ATTGTGCCCG CCTTCTGCTA GGAGGAGGCA AGAACGGACC TCGAGTAAAC CGGATTATTG 
TGCAGCCAAG CACAAGGAGG ATCCATCATG ACGCTCAGTA TGAGGATGCA GGATGCGAGA 
TTTCAGAAGA CCTGTCAGAA TGCGGCCTTA TCATAGGCAT CAAACAACCC AAGCTGCAGA 
TGATTCTTTC AGATAGAGCG TACGCTTTCT TTTCACACAC ACACAAAGCC CAAAAAGAGA 
ATATGCCACT GTTAGACAAG ATCCTTGAAG AAAGGGTGTC CTTGTTTGAT TATGAGCTAA 
TTGTTGGAGA TGATGGGAAA AGATCACTAG CATTTGGGAA ATTTGCTGGT AGAGCTGGAC 
TGATAGATTT CTTACATGGT CTCGGACAGC GATATTTGAG CCTTGGATAC TCGACTCCAT 
TTCTCTCTCT GGGACAATCT CATATGTATC CTTCGCTCGC TGCAGCCAAG GCTGCAGTCA 
TTGTCGTTGC AGAAGAGATA GCAACATTTG GACTTCCATC CGGAATTTGT CCGATAGTGT 
TTGTGTTCAC TGGAGTTGGA AACGTCTCTC AGGGTGCGCA GGAGATATTC AAGTTATTGC 
CCCATACCTT TGTTGATGCT GAGAAGCTTC CCGAAATTTT TCAGGCCAGG AATCTGTCTA 
AGCAATCTCA GTCGACCAAG AGAGTATTTC AACTTTATGG TTGTGTTGTG ACCTCTAGAG 
ACATAGTTTC TCACAAGGAT CCCACCAGAC AATTTGACAA AGGTGACTAT TATGCTCATC 
CAGAACACTA CACCCCTGTT TTTCATGAAA GAATTGCTCC ATATGCATCT GTCATCGTAA 
ACTGTATGTA TTGGGAGAAG AGGTTTCCAC CATTACTAAA TATGGATCAG TTACAGCAAT 
* TGATGGAGAC TGGTTGTCCT TTAGTCGGCG TTTGTGACAT AACTTGTGAT ATTGGAGGTT 
CCATTGAATT TATCAACAAG AGTACATCAA TAGAGAGGCC TTTCTTTCGG TATGATCCTT 
CTAAGAATTC ATACCATGAT GATATGGAAG GTGCCGGAGT GGTCTGCTTG GCTGTTGACA 
TTCTCCCTAC AGAATTCTCT AAAGAGGCCT CCCAACATTT TGGAAACATA CTATCTAGAC 
TTGTTGCTAG TTTGGCCTCA GTGAAGCAAC CGGCAGAACT TCCTTCCTAC TTGAGAAGAG 


60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 
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CTTGCATTGC ACATGCTGGC AGATTAACTC CTTTGTATGA ATATATCCCT AGGATGAGAA 
ATACTATGAT AGATTTGGCA CCCGCAAAAA CAAATCCATT GCCTGACAAG AAGTATAGCA 
CCCTGGTATC TCTCAGTGGG CACCTATTTG ATAAGTTCCT TATAAATGAA GCTTTGGACA 
TCATTGAGAC AGCTGGAGGT TCATTTCACT TGGTTAGATG TGAAGTTGGA CAAAGCACGG 
ATGATATGTC ATACTCAGAG CTTGAAGTAG GAGCAGATGA TACTGCCACA TTGGATAAAA 
TTATTGATTC CTTGACTTCT TTAGCTAATG AACATGGTGG AGATCACGAT GCCGGGCAAG 
AAATTGAATT AGCTCTGAAG ATAGGAAAAG TCAATGAGTA TGAAACTGAC GTCACAATTG 
ATAAAGGAGG GCCAAAGATT TTAATTCTTG GAGCTGGAAG AGTCTGTCGG CCAGCTGCTG 
AGTTTCTGGC ATCTTACCCA GACATATGTA CCTATGGTGT TGATGACCAT GATGCAGATC 
AAATTCATGT TATCGTGGCA TCTTTGTATC AAAAAGATGC AGAAGAGACA GTTGATGGTA 
TTGAAAATAC AACTGCTACC CAGCTTGATG TTGCTGATAT TGGAAGCCTT TCAGATCTTG 
TTTCTCAGGT TGAGGTTGTA ATTAGCTTGC TGCCTGCTAG TTTTCATG 
(2) INFORMATION FOR SEQ ID NO:124: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 640 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Zea mays 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:124: 

Cy» Al, Arg Leu Leu Leu Gly Gly Gly Lye Asn Gly Pro Arg v.l Aon 

10 i 5 

Arg lie lie V.l Gin Pro s.r Thr Arg Arg He Hi. His Asp Al, Gin 

25 30 

Tyr Glu Asp Ala Gly Cys Glu lie Ser Glu Asp Leu Ser Glu Cys Gly 

LeU 116 Gly 116 LyS Pr ° L * s Leu Gin Met lie Leu Ser Asp 

Arg Ala Tyr Ala Phe Phe Ser His Thr His Lys Ala Gin Lys Glu Asn 

75 80 

Met Pro Leu Leu Asp Lys He Leu Glu Glu Arg Val Ser Leu Phe Asp 

90 95 

y Glu Leu lie Val Gly Asp Asp Gly Lys Arg Ser Leu Ala Phe Gly 


1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1908 
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Lys Phe Ala Gly Arg Ala Gly Leu lie Asp Phe Leu His Gly Leu Glv 
115 120 125 

Gin Arg Tyr Leu Ser Leu Gly Tyr Ser Thr Pro Phe Leu Ser Leu Glv 
130 135 140 Y 

Gin Ser His Met Tyr Pro Ser Leu Ala Ala Ala Lys Ala Ala Val lie 

145 150 155 160 

Val Val Ala Glu Glu lie Ala Thr Phe Gly Leu Pro Ser Gly lie Cvs 

165 170 175 

Pro lie Val Phe Val Phe Thr Gly Val Gly Asn Val Ser Gin Gly Ala 

185 ^20 

Gin Glu lie Phe Lys Leu Leu Pro His Thr Phe Val Asp Ala Glu Lys 
195 200 205 

Leu Pro Glu lie Phe Gin Ala Arg Asn Leu Ser Lys Gin Ser Gin Ser 

215 220 

Thr Lys Arg Val Phe Gin Leu Tyr Gly Cys Val Val Thr Ser Arg Asp 
223 230 235 240 

lie Val Ser His Lys Asp Pro Thr Arg Gin Phe Asp Lys Gly Asp Tyr 

245 250 255 

Tyr Ala His Pro Glu His Tyr Thr Pro Val Phe His Glu Arg lie Ala 
260 265 270 

Pro Tyr Ala Ser Val lie Val Asn Cys Met Tyr Trp Glu Lys Arg Phe 
275 280 285 

Pr ° oan LeU LeU ASn M6t ASP Gln Leu Gln Gln Leu Met Glu Thr Gly 
zyu 295 300 

Cys Pro Leu Val Gly Val Cys Asp lie Thr Cys Asp lie Gly Gly Ser 

310 315 320 

lie Glu Phe He Asn Lys Ser Thr Ser lie Glu Arg Pro Phe Phe Arg 
325 330 335 

Tyr Asp Pro Ser Lys Asn Ser Tyr His Asp Asp Met Glu Gly Ala Gly 

340 345 350 y 

Val Val Cys Leu Ala Val Asp lie Leu Pro Thr Glu Phe Ser Lys Glu 

■*=>=> 360 365 

Ala f®J; Gln His phe G1 y A sn lie Leu Ser Arg Leu Val Ala Ser Leu 
J,u 375 380 

Ala Ser Val Lys Gln Pro Ala Glu Leu Pro Ser Tyr Leu Arg Arg Ala 

390 395 400 

Cys lie Ala His Ala Gly Arg Leu Thr Pro Leu Tyr Glu Tyr lie Pro 
405 410 415 

Arg Met Arg Asn Thr Met He Asp Leu Ala Pro Ala Lys Thr Asn Pro 
420 4 25 430 

Leu Pro Asp Lys Lys Tyr Ser Thr Leu Val Ser Leu Ser Gly His Leu 
4 4 4 0 4 4 5 

Phe Asp Lys Phe Leu lie Asn Glu Ala Leu Asp lie lie Glu Thr Ala 

455 450 
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Gly Gly Ser Phe His Leu Val Arg 
465 470 

Asp Met Ser Tyr Ser Glu Leu Glu 
485 


Leu Asp Lys lie lie Asp Ser Leu 
500 


Gly Asp His Asp Ala Gly Gin Glu 
515 520 

Lys Val Asn Glu Tyr Glu Thr Asp 
530 535 


Cys Glu Val Gly Gin Ser Thr Asp 
475 480 

Val Gly Ala Asp Asp Thr Ala Thr 
490 495 

Thr Ser Leu Ala Asn Glu His Gly 
505 510 

lie Glu Leu Ala Leu Lys lie Gly 
525 

Val Thr lie Asp Lys Gly Gly Pro 
540 


Lys lie Leu lie Leu Gly Ala Gly Arg Val Cys Arg Pro Ala Ala Glu 

545 550 555 560 

Phe Leu Ala Ser Tyr Pro Asp He Cys Thr Tyr Gly Val Asp Asp His 

565 570 575 

Asp Ala Asp Gin lie His Val lie Val Ala Ser Leu Tyr Gin Lys Asd 
580 585 590 

Ala Glu Glu Thr Val Asp Gly lie Glu Asn Thr Thr Ala Thr Gin Leu 
595 600 605 

Asp Val Ala Asp lie Gly Ser Leu Ser Asp Leu Val Ser Gin Val Glu 
610 615 620 

Val Val lie Ser Leu Leu Pro Ala Ser Phe His Ala Ala lie Ala Glv 

ri y S c. o n >—. — 


(2) INFORMATION FOR SEQ ID NO:125: 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 720 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 


(ii) MOLECULE TYPE: cDNA to mRNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Oryza sativa 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 2..720 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 215 

(D) OTHER INFORMATION: /label= unknown 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 678 

(D) OTHER INFORMATION: /label= unknown 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:125: 

GTTTAAACAT CTTTCCAATC TTGTTTCTCA GGTTGAAGTA GTAGTTAGCT TGCTGCCTGC 60 
CAGTTTTCAT GCTGCCATAG CAAGAGTATG CATAGAGATG AAGAAGCACT TGGTCACTGC 120 
AAGCTATGTT GATGAGTCCA TGTCAAAGTT GGAACAATCT GCAGAAGGTG CTGGTGTAAC 180 
TATTCTCTGT GAAATGGGCC TGGATCCTGG CATANATCAT ATGATGTCAA TGAAGATGAT 240 
TGACGAAGCA CATTCACGGA AGGGGAAAAT AAAGTCATTT ACATCCTTTT GTGGAGGACT 300 
TCCATCTCCA GCTTCTGCAA ACAATCCACT TGCTTATAAG TTCAGTTGGA GTCCAGCTGG 360 
TGCCATCCGT GCAGGGAGAA ACCCTGCTGT CTACAAATTT CATGGAGAAA TCATCCATGT 420 
AGATGGTGAT AAATTGTATG AATCCGCAAA GAGGCTCAGA TTACMAGAAC TTCCAGCTTT 480 
TGCACTGGAA CACTTGCCAA ACCGGAATTC CTTGATGTAT GGAGACCTGT ATGGGATCTC 540 
CAAAGAAGCA TCTACTGTGT ACAGGGCTAC TCTTCGTTAT GAAGGATTTA AT GAGATAAT 600 
GGCAACCTTC GCGAAAATTG GGTTTTTTGA TGCTGCAAGT CATCCACTGT TGCAACAAAC 660 

TACTCGCCCT ACATACANGG ATTTCCTGTT GAACCCTCAA TGCTTGTACA TCTCCAAAAC 720 

(2) INFORMATION FOR SEQ ID NO:126: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 239 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 


(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Oryza sativa 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:126: 

Phe Lys His Leu Ser Asn Leu Val Ser Gin Val Glu Val Val Val Ser 

Leu Leu Pro Ala Ser Phe His Ala Ala He Ala Arg Val Cys lie Glu 

25 30 

Met Lys Lys His Ley Vai Thr SI, Ser Tyr Vai Ssp Glu Ser Mat Ser 

Lys Leu Glu Gin Ser Ala Glu Gly Ala sly Val Thr II. Leu Cys Glu 

55 60 

Met Gly Leu Asp Pro Gly He xaa His Met Met Ser Met Lys Met He 

75 80 

Asp Glu Ala His Ser Arg Lys Gly Lys He Lys Ser Phe Thr Ser Phe 

90 95 
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Cys Gly Gly Leu Pro Ser Pro Ala Ser Ala Asn Asn Pro Leu Ala Tvr 
100 105 110 

Lys Phe Ser Trp Ser Pro Ala Gly Ala He Arg Ala Gly Arg Asn Pro 
115 120 125 

Ala Val Tyr Lys Phe His Gly Glu lie lie His Val Asp Gly Asp Lys 
130 135 140 

Leu Tyr Glu Ser Ala Lys Arg Leu Arg Leu Xaa Glu Leu Pro Ala Phe 
145 150 155 160 

Ala Leu Glu His Leu Pro Asn Arg Asn Ser Leu Met Tyr Gly Asp Leu 

165 170 175 

Tyr Gly lie Ser Lys Glu Ala Ser Thr Val Tyr Arg Ala Thr Leu Ara 
180 185 190 

Tyr Glu Gly Phe Asn Glu lie Met Ala Thr Phe Ala Lys lie Glv Phe 
195 200 205 

Phe Asp Ala Ala Ser His Pro Leu Leu Gin Gin Thr Thr Arg Pro Thr 
210 215 220 

Tyr Xaa Asp Phe Leu Leu Asn Pro Gin Cys Leu Tyr lie Ser Lvs 

225 230 235 

(2) INFORMATION FOR SEQ ID NO:127: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 308 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 


(ii) MOLECULE TYPE: cDNA to mRNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Oryza sativa 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..129 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:127: 

CTGCTGTTGC TCCAGAACAA GATCCAAAAG AAAGGAGTGA TCAGGCCTCT GGAACCTGAA 
ATTTACATTC CAGCGTTGGA GATCTTGGAG TCATCGGGTA TCAAGCTGGC GGAGAGAGTG 
GAGACCTGAG AATCGGACCC AATATGTATA ATGTAGCATG GTGGTAGCTT CTCTATATAT 
ATGCTTCAGT GAATAATTGA TTTGCCGTTG TGTGGTAATT AAGCAATGCC CGCTAATAAA 
TTGTACCGTA GAAGTCCTTC TATGTACATC CGTATCAAAA AATAAAAAAA GCATCGATTA 
GCTTGAAT 


60 

120 

180 

240 

300 

308 
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( 2 ) 


INFORMATION FOR SEQ ID NO:128: 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Oryza sativa 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:128: 

Leu Leu Leu Leu Gin Asn Lys lie Gin Lys Lys Gly Val lie Arg Pro 

1 5 10 is 

Leu Glu Pro Glu lie Tyr lie Pro Ala Leu Glu lie Leu Glu Ser Ser 

20 25 30 

Gly lie Lys Leu Ala Glu Arg Val Glu Thr 
35 40 

(2) INFORMATION FOR SEQ ID NO:129: 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 429 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 


(vi) ORIGINAL SOURCE: 


(A) ORGANISM: 

(ix) FEATURE: 

(A) NAME/KEY: 

(B) LOCATION: 


Triticum aestivum 

CDS 
1..252 


(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 172 ~ 

(D) OTHER INFORMATION: /label= unknown 


(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 186 

(D) OTHER INFORMATION: /label= unknown 

(ix) FEATURE: 

(A) NAME/KEY: misc feature 

(B) LOCATION: 331 ~ 

(D) OTHER INFORMATION: /label= unknown 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:129: 

TACCCCGACG GGGACCCCAC CGAGAAGCAC CAAGCGACGC TGCTGGAGTT CGGAAAGACC 60 
GAGAACGGCA GGCCCACCAC CGCCATGGCC CTCACCGTTG GGGTACCGGC AGCGATAGGA 120 
GCCCTGCTCT TGCTCCAGAA CAAGGTCCAG AGGAAAGGGG TGATCCGGCC TNTGGAACCG 180 
GAGATNTACA TCCCTGCGCT GGAGATCTTG GAAGCGTCGG GCATCAAGCT GATCGAGAGA 240 
GTGGAGACCT GAGGATGTCA GGATGGGATG AGAATCTATC GAGTATATAT GCTGCAGCAA 300 
CAGAGGCAGT GAGTAAATAA AATGATGATT NTCGCCGTTG TAAGTAAAAT GAGTGGACTG 360 
TATGTATGTA TGTGACTATC TATTGTACTA CATATATACC AAATCTGTCG CCGGTTGATT 420 


(2) INFORMATION FOR SEQ ID NO:130: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 83 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Triticum aestivum 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO:130: 


Tyr 

1 

Pro 

Asp 

Gly Asp 
5 

Pro 

Thr 

Glu 

Lys 

His 

10 

Gin 

Ala 

Thr 

Leu 

Leu 

15 

Glu 

Phe 

Gly 

Lys 

Thr 

20 

Glu 

Asn 

Gly 

Arg 

Pro 

25 

Thr 

Thr 

Ala 

Met 

Ala 

30 

Leu 

Thr 

Val 

Gly 

Val 

35 

Pro 

Ala 

Ala 

lie 

Gly Ala 
40 

Leu 

Leu 

Leu 

Leu 

45 

Gin 

Asn 

Lys 

Val 

Gin 

50 

Arg 

Lys 

Gly 

Val 

lie 

55 

Arg 

Pro 

Xaa 

Glu 

Pro 

60 

Glu 

Xaa 

Tyr 

lie 

Pro 

65 

Ala 

Leu 

Glu 

He 

Leu 

70 

Glu 

Ala 

Ser 

Gly 

lie 

75 

Lys 

Leu 

lie 

Glu 

Arg 

80 


Val Glu Thr 


(2) INFORMATION FOR SEQ ID NO:131: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1449 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:131: 

ATGACGAAAA AATCAGGTGT TTTGATTCTT GGTGCTGGAC GTGTGTGTCG CCCAGCTGCT 60 
GATTTCCTAG CTTCAGTTAG AACCATTTCG TCACAGCAAT GGTACAAAAC ATATTTCGGA 120 
GCAGACTCTG AAGAGAAAAC AGATGTTCAT GTGATTGTCG CGTCTCTGTA TCTTAAGGAT 180 
GCCAAAGAGA CGGTTGAAGG TATTTCAGAT GTAGAAGCAG TTCGGCTAGA TGTATCTGAT 240 
AGTGAAAGTC TCCTTAAGTA TGTTTCTCAG GTTGATGTTG TCCTAAGTTT ATTACCTGCA 300 
AGTTGTCATG CTGTTGTAGC AAAGACATGC ATTGAGCTGA AGAAGCATCT CGTCACTGCT 360 

AGCTATGTTG ATGATGAAAC GTCCATGTTA CATGAGAAGG CTAAGAGTGC TGGGATAACG 420 

ATTCTAGGCG AAATGGGACT GGACCCTGGA ATCGATCACA TGATGGCGAT GAAAATGATC 480 

AACGATGCTC ATATCAAAAA AGGGAAAGTG AAGTCTTTTA CCTCTTATTG TGGAGGGCTT 540 

CCCTCTCCTG CTGCAGCAAA TAATCCATTA GCATATAAAT TTAGCTGGAA CCCTGCTGGA 600 
GCAATTCGAG CTGGTCAAAA CCCCGCCAAA TACAAAAGCA ACGGCGACAT AATACATGTT 660 
GATGGGAAGA ATCTCTATGA TTCCGCGGCA AGATTCCGAG TACCTAATCT TCCAGCTTTT 720 
GCATTGGAGT GTTTTCCAAA TCGTGACTCC TTGGTTTACG GGGAACATTA TGGCATCGAG 780 
AGCGAAGCAA CAACGATATT TCGTGGAACA CTCAGATATG AAGGGTTTAG TATGATAATG 840 
GCAACACTTT CGAAACTTGG ATTCTTTGAC AGTGAAGCAA ATCAAGTACT CTCCACTGGA 900 
AAGAGGATTA CGTTTGGTGC TCTTTTAAGT AACATTCTAA ATAAGGATGC AGACAATGAA 960 
TCAGAGCCCC TAGCGGGAGA AGAAGAGATA AGCAAGAGAA TTATCAAGCT TGGACATTCC 1020 
AAGGAGACTG CAGCCAAAGC TGCCAAAACA ATTGTATTCT TGGGGTTCAA CGAAGAGAGG 1080 
GAGGTTCCAT CACTGTGTAA AAGCGTATTT GATGCAACTT GTTACCTAAT GGAAGAGAAA 1140 
CTAGCTTATT CCGGAAATGA ACAGGACATG GTGCTTTTGC ATCACGAAGT AGAAGTGGAA 1200 
TTCCTTGAAA GCAAACGTAT AGAGAAGCAC ACTGCGACTC TTTTGGAATT CGGGGACATC 1260 
AAGAATGGAC AAACAACAAC CGCTATGGCC AAGACTGTTG GGATCCCTGC AGCCATTGGA 1320 
GCTCTGGTGT TAATTGAAGA CAAGATCAAG ACAAGAGGAG TCTTAAGGCC TCTCGAAGCA 1380 
GAGGTGTATT TGCCAGCTTT GGATATATTG CAAGCATATG GTATAAAGCT GATGGAGAAG 1440 
GCAGAATGA 


1449 



(2) INFORMATION FOR SEQ ID NO:132: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 482 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:132: 

r Thr ^ *** I" Gly Val U * *•» a, Ala Gl y Arg V.! c y s 

10 15 

Arg Pro Ala Ala Asp Phe Leu Ala Ser Val Arc Thr t, q 

20 1 Arg Thr Ile Ser Ser Gin 

^ 30 

Trp Tyr Lys Iht Tyr ph . „ y flsp sst Giu ^ ^ ^ ^ 

731 50 S Val 116 Val Ala LeU Tyr Leu L VS Asp Ala Lys Glu Thr 

03 60 

V.! Giu Gly n. ser ^ val Glu Val ^ Leu ftap val sec flsp 

75 80 

Ser Glu Ser Leu Leu Lys Tyr Val Ser pin v i „ 

e5 y yr val Ser Gin Val Asp Val Val Leu Ser 

90 95 

l “ L " “ S " CyS HiS *■! I*. «ur Cys lie G i„ 

U5 110 

Leu Lys Lys His Leu Val Thr ai= o 

115 ]?fl Tyr Val Asp Asp Glu Thr Ser 

zu 125 

M.t L.„ His Giu i.y, i ser Gly Thr Ila Leu 

D 140 

Met Gly Leu Asp Pro Gly ll e Asn Hi o m +. 

145 150 Ala Met Lys Met He 

155 160 

P Ala His lie Lys Lys Gly Lys Val Lys Ser Phe Thr Ser Tyr 

/0 175 

Cys Gly Gly Leu Pro Ser Pro Ala fli a a, - 

180 * A1 Asn Asn Pr ° Leu Ala Tyr 

185 190 

Ays Phe Ser ftp Asn ft. Aia Gly Ai, iie An, Ai, Gly Gin Asn Pro 

ZUU 205 

Ala Ays Tyr hys s . r Asn Asp ^ Asp 

220 

S TYr *" S " fll * ^ Ph ‘ ™ Aeu ft. in, Pha 

235 O . n 
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t 


* 


Ala Leu Glu Cys Phe Pro Asn Arg Asp Ser Leu Val Tyr Gly Glu His 
245 250 255 

Tyr Gly He Glu Ser Glu Ala Thr Thr He Phe Arg Gly Thr Leu Arg 

265 270 

Tyr Glu Gly Phe Ser Bet He Met Ala Thr Leu Ser Lye Leu Gly Phe 

280 285 

Phe Asp ser Glu Ala Asn Gin Val Leu Ser Thr Gly Lys Arg He Thr 

2y5 300 

Phe Gly Ala Leu Leu Ser Asn He Leu Asn Lys Asp Ala Asp Asn Glu 

310 313 320 

Ser Glu Pro Leu Ala Gly Glu Glu Glu He Ser Lys Arg He He Lys 

330 335 

Leu Gly His Ser Lys Glu Thr Ala Ala Lys Ala Ala Lys Thr II, val 

345 350 

Phe Leu Gly Phe Asn Glu Glu Arg Glu Val Pro Ser Leu Cys Lys Ser 

360 365 

Asp Ala Thr Cys Tyr Leu Met - Pi n t 

370 G1U Glu Lys Leu A1 * Tyr Ser 

380 

Gly Asn Glu Gin Asp Met Val Leu Leu His His Glu Val Glu Val Glu 

oyu oqc 

400 


Phe Leu Glu Ser Lys Arg He Glu Lys His Thr Ala Thr Leu Leu Glu 

410 415 

Phe Gly Asp lie Lys Asn Gly Gin Thr Thr Thr Ala Met Ala Lys Thr 

425 430 

V.1 Gly II. pro Ala Ala He Gly Ala Leu Val Leu He Glu Asp Lys 


445 


He Lys Thr Arg Gly Val Leu Arg Pro Leu Glu Ala Glu Val Tyr Leu 

460 

Pro Ala Leu Asp He Leu Gin Ala Tyr Gly lie Lys Leu Bet Glu Lys 


475 


480 


Ala Glu 
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